时间:2022-05-11 09:00:29

I have an HTML (not XHTML) document that renders fine in Firefox 3 and IE 7. It uses fairly basic CSS to style it and renders fine in HTML.

我有一个HTML(不是XHTML)文档,在Firefox 3和IE 7中表现良好。它使用相当基本的CSS来样式化它,并在HTML中表现得很好。

I'm now after a way of converting it to PDF. I have tried:


  • DOMPDF: it had huge problems with tables. I factored out my large nested tables and it helped (before it was just consuming up to 128M of memory then dying--thats my limit on memory in php.ini) but it makes a complete mess of tables and doesn't seem to get images. The tables were just basic stuff with some border styles to add some lines at various points;
  • DOMPDF:它的表格有很大的问题。我提出了我的大型嵌套表,它起了作用(之前它只消耗了128M的内存,然后就会死——这是我对php.ini中的内存的限制),但它把表搞得一塌糊涂,似乎无法获得图像。表只是一些基本的东西,带有一些边框样式,在不同的点上添加一些线条;
  • HTML2PDF and HTML2PS: I actually had better luck with this. It rendered some of the images (all the images are Google Chart URLs) and the table formatting was much better but it seemed to have some complexity problem I haven't figured out yet and kept dying with unknown node_type() errors. Not sure where to go from here; and
  • HTML2PDF和HTML2PS:实际上我有更好的运气。它呈现了一些图像(所有的图像都是谷歌图表url),表格格式也更好,但是它似乎有一些复杂的问题,我还没有弄清楚,并且不断地使用未知的node_type()错误。不知道从这里去哪里;和
  • Htmldoc: this seems to work fine on basic HTML but has almost no support for CSS whatsoever so you have to do everything in HTML (I didn't realize it was still 2001 in Htmldoc-land...) so it's useless to me.
  • Htmldoc:这似乎在基本的HTML上工作得很好,但是几乎不支持CSS,所以你必须用HTML做所有的事情(我不知道在Htmldoc-land还是2001年……),所以对我来说,它毫无用处。

I tried a Windows app called Html2Pdf Pilot that actually did a pretty decent job but I need something that at a minimum runs on Linux and ideally runs on-demand via PHP on the Webserver.

我尝试了一个叫做Html2Pdf Pilot的Windows应用程序,它实际上做得相当不错,但我需要的东西至少能在Linux上运行,最好能在Webserver上通过PHP按需运行。

What am I missing, or how can I resolve this issue?


30 个解决方案



Important: Please note that this answer was written in 2009 and it might not be the most cost-effective solution today in 2018. Online alternatives like PDFShift are better today at this than they were back then.


Have a look at PrinceXML.


It's definitely the best HTML/CSS to PDF converter out there, although it's not free (But hey, your programming might not be free either, so if it saves you 10 hours of work, you're home free (since you also need to take into account that the alternative solutions will require you to setup a dedicated server with the right software)

这绝对是最好的HTML / CSS PDF转换器,虽然这不是免费的(但是,嘿,你的编程可能不是免费的,如果它可以节省你10个小时的工作,你回来免费(因为您还需要考虑替代方案将要求您设置一个专用的服务器使用正确的软件)

Oh yeah, did I mention that this is the first (and probably only) HTML2PDF solution that does full ACID2 ?


PrinceXML Samples




Have a look at wkhtmltopdf . It is open source, based on webkit and free.


We wrote a small tutorial here.


EDIT( 2017 ):


If it was to build something today, I wouldn't go that route anymore.
But would use http://pdfkit.org/ instead.
Probably stripping it of all its nodejs dependencies, to run in the browser.




After some investigation and general hair-pulling the solution seems to be HTML2PDF. DOMPDF did a terrible job with tables, borders and even moderately complex layout and htmldoc seems reasonably robust but is almost completely CSS-ignorant and I don't want to go back to doing HTML layout without CSS just for that program.


HTML2PDF looked the most promising but I kept having this weird error about null reference arguments to node_type. I finally found the solution to this. Basically, PHP 5.1.x worked fine with regex replaces (preg_replace_*) on strings of any size. PHP 5.2.1 introduced a php.ini config directive called pcre.backtrack_limit. What this config parameter does is limits the string length for which matching is done. Why this was introduced I don't know. The default value was chosen as 100,000. Why such a low value? Again, no idea.

HTML2PDF看起来是最有希望的,但是我一直有这个关于node_type的空引用参数的奇怪错误。我终于找到了解决办法。基本上,PHP 5.1。在任何大小的字符串中,x都可以使用regex替换(preg_replace_*)。PHP 5.2.1引入了PHP。ini配置指令称为pcr .backtrack_limit。这个配置参数的作用是限制完成匹配的字符串长度。我不知道为什么会出现这种情况。默认值选择为100,000。为什么这么低的价值?再一次,不知道。

A bug was raised against PHP 5.2.1 for this, which is still open almost two years later.

针对PHP 5.2.1提出了一个bug,该bug在两年后仍然开放。

What's horrifying about this is that when the limit is exceeded, the replace just silently fails. At least if an error had been raised and logged you'd have some indication of what happened, why and what to change to fix it. But no.


So I have a 70k HTML file to turn into PDF. It requires the following php.ini settings:


  • pcre.backtrack_limit = 2000000; # probably more than I need but that's OK
  • pcre。backtrack_limit = 2000000;可能比我需要的多,但没关系
  • memory_limit = 1024M; # yes, one gigabyte; and
  • memory_limit = 1024;#是的,1 g;和
  • max_execution_time = 600; # yes, 10 minutes.
  • max_execution_time = 600;#是的,10分钟。

Now the astute reader may have noticed that my HTML file is smaller than 100k. The only reason I can guess as to why I hit this problem is that html2pdf does a conversion into xhtml as part of the process. Perhaps that took me over (although nearly 50% bloat seems odd). Whatever the case, the above worked.


Now, html2pdf is a resource hog. My 70k file takes approximately 5 minutes and at least 500-600M of RAM to create a 35 page PDF file. Not quick enough (by far) for a real-time download unfortunately and the memory usage puts the memory usage ratio in the order of 1000-to-1 (600M of RAM for a 70k file), which is utterly ridiculous.

html2pdf是一个资源占有者。我的70k文件需要大约5分钟和至少500-600M的RAM来创建一个35页的PDF文件。不幸的是,对于实时下载来说(到目前为止)还不够快,内存使用使内存使用率达到了1000比1 (70k文件的600M内存),这是非常荒谬的。

Unfortunately, that's the best I've come up with.




Why don’t you try mPDF version 2.0? I used it for creating PDF a document. It works fine.


Meanwhile mPDF is at version 5.7 and it is actively maintained, in contrast to HTML2PS/HTML2PDF


But keep in mind, that the documentation can really be hard to handle. For example, take a look at this page: https://mpdf.github.io/.


Very basic tasks around html to pdf, can be done with this library, but more complex tasks will take some time reading and "understanding" the documentation.




1) use MPDF !

a) extract in yourfolder


b) create file.php in yourfolder and insert such code:


$mpdf=new mPDF();
$mpdf->WriteHTML('<p style="color:red;">Hallo World<br/>Fisrt sentencee</p>');
$mpdf->Output();   exit;

c) open file.php from your browser


2) Use pdfToHtml !

1) extract pdftohtml.exe to your root folder:


2) inside that folder, in anyfile.php file, put this code (assuming, there is a source example.pdf too):



    if (!file_exists($output_fold)) { mkdir($output_fold, 0777, true);}
$result= passthru("pdftohtml $source $output_fold/new_filename",$log);
//var_dump($result); var_dump($log);

3) enter FinalFolder, and there will be the converted files (as many pages, as the source PDF had..)




Checkout TCPDF. It has some HTML to PDF functionality that might be enough for what you need. It's also free!




I suggest DocRaptor (which uses PrinceXML as the "engine")




Just to bump the thread, I've tried DOMPDF and it worked perfectly. I've used DIV and other block level elements to position everything, kept it strictly CSS 2.1 and it played veru nicely.

我试过了DOMPDF,它运行得很好。我使用了DIV和其他块级元素来定位所有的东西,严格地保持它的CSS 2.1并且它运行得很好。



Good news! Snappy!!

好消息!时髦的! !

Snappy is a very easy open source PHP5 library, allowing thumbnail, snapshot or PDF generation from a url or a html page. And... it uses the excellent webkit-based wkhtmltopdf


Enjoy! ^_^

享受吧!^ _ ^



Well if you want to find a perfect XHTML+CSS to PDF converter library, forget it. It's far from possible. Because it's just like finding a perfect browser (XHTML+CSS rendering engine). Do we have one? IE or FF?


I have had some success with DOMPDF. The thing is that you have to modify your HTML+CSS code to go with the way the library is meant to work. Other than that, I have pretty good results.


See below:


Original HTML


Converting HTML to PDF




It's already been mentioned, but I'd just like to confirm that mpdf is the easiest, most powerful and most free html to pdf converter out there. The sky's really the limit. You can even generate pdfs of dynamic, user generated data.


For instance, a client wanted a CMS system so he could update the track list of the music he played at his club. That was no problem, but he also wanted users to be able download a .pdf of the playlist, and so this downloadable pdf had to be updated by the cms too. Thanks to mpdf, with some simple loops and interspersed variables I could do just that. Something that I thought would take me weeks literally took me minutes.


Great article that helped me get started.




I am using fpdf to produce pdf files using php. It's working well for me so far to produce simple outputs.




The HTML2PDF and HTML2PS that was originally mentioned in opening post was talking about a 2009 package with this link


But there is a better HTML2PDF


It is based on TCPDF though it is partly in French.


You can have table headers or footers that repeat on the pages and have page numbers and total pages. See its examples. I have been using it for over three years and recommend it.




There's a tutorial on Zend's devzone on generating pdf from php (part 1, part 2) without any external libraries. I never implemented this sort of solution, but since it's all php, you might find it more flexible to implement and debug.




In terms of cost, using a web-service (API) may in many cases be the more sensible approach. Plus, by outsourcing this process you unburden your own infrastructure/backend and - provided you are using a reputable service - ensure compatibility with adjusting web standards, uptime, short processing times and quick content delivery.


I've done some research on most of the web services currently on the market, please find below the APIs that I feel are worth mentioning on this thread, in an order based on price/value ratio. All of them are offering pre-composed PHP classes and packages.


  1. pdflayer.com - Cost: $ - Quality: ☆☆☆☆
  2. pdflayer.com成本:$ -质量:☆☆☆☆
  3. docraptor.com - Cost: $$$ - Quality: ☆☆☆☆☆
  4. docraptor.com成本:$ $ $ -质量:☆☆☆☆☆
  5. pdfcrowd.com - Cost: $$ - Quality: ☆☆☆
  6. pdfcrowd.com成本:$ $ -质量:☆☆☆



Having the high-quality engine PrinceXML as a backbone, DocRaptor clearly offers the best PDF quality, returning highly polished and well converted PDF documents. However, the pdflayer API service gets pretty close here. Pdfcrowd does not necessarily score with quality, but with processing speed.

DocRaptor拥有高质量的引擎PrinceXML作为主干,显然提供了最好的PDF质量,返回高度抛光和良好转换的PDF文档。但是,pdflayer API服务在这里非常接近。Pdfcrowd并不一定以质量得分,而是以处理速度得分。



pdflayer.com - As indicated above, the most cost-effective option here is pdflayer.com, offering an entirely free subscription plan for 100 monthly PDFs and premium subscriptions ranging between $9.99-$119.99. The price for 10,000 monthly PDF documents is $39.99.


docraptor.com - Offering a 7-Day Free Trial period. Premium subscription plans range from $15-$2250. The price for 10,000 monthly PDF documents is ~ $300.00.


pdfcrowd.com - Offering 100 PDFs once for free. Premium subscription plans range from $9-$89. The price for 10,000 monthly PDF documents is ~ $49.00.

pdfcrowd.com -免费提供100个pdf文件。保费认购计划从9美元到89美元不等。每月10,000份PDF文件的价格在$49.00左右。

I've used all three of them and this text is supposed to help anyone decide without having to pay for all of them. This text has not been written to endorse any one product and I have no affiliation with any of the products.




If you have access to the command line it's possible to use PhantomJS to create the PDF from an URL (remote or local).


It works really well, and is a free solution.


Take a look at this example script made for this exact problem.




Darryl Hein's mention above of TCPDF is likely a great idea. Nicola Asuni's code is pretty handy and powerful. The only killer is if you ever plan on merging PDF files with your generated PDF it doesn't have those features. You would have to create the PDF and then merge it using something like PDFTK by Sid Steward (www.pdflabs.com/tools/pdftk-the-pdf-toolkit/).

Darryl Hein在上面提到TCPDF可能是个好主意。Nicola Asuni的代码非常方便和强大。唯一的杀手是,如果你计划将PDF文件与生成的PDF文件合并,它没有这些功能。您必须创建PDF,然后使用Sid Steward的PDFTK合并它(www.pdflabs.com/tools/pdftk- PDF -toolkit/)。



This question is pretty old already, but haven't seen anyone mentioning CutyCapt so I will :)




CutyCapt is a small cross-platform command-line utility to capture WebKit's rendering of a web page into a variety of vector and bitmap formats, including SVG, PDF, PS, PNG, JPEG, TIFF, GIF, and BMP




Try grabbing the latest nightly dompdf build - I was using an older version that was a terrible resource hog and took forever to render my pdf. After grabbing a nightly from here.


It only took a few seconds to generate the PDF - AND it was just as nicely rendered as with PrinceXML / Docraptor. Seems like they've seriously optimized the dompdf code since I last used it!

只需要几秒钟就能生成PDF,而且它和PrinceXML / Docraptor一样漂亮。似乎自从我上次使用dompdf代码以来,他们已经对它进行了认真的优化!



I developed a public API to build PDF files from web pages. It has a nice PHP client class that makes it super easy to use. It uses wkhtmltopdf to render the PDF in the cloud.


No need for anything special in the HTML. No need for absolute URLS in images/css/js links. Works on localhost (dev machine) too.

在HTML中不需要任何特殊的东西。在图像/css/js链接中不需要绝对url。也可以在localhost (dev机器)上工作。

Currently the service has endpoints in 4 Azure regions: US East, US West, EU North, Southeast Asia.


It's fast since it uses a proprietary protocol to send the web page contents to the API for conversion to PDF.


It's reliable because all endpoints are load balanced.


Free account available for testing or low usage. Details on the web site:






Perhaps you might try and use Tidy before handing the file to the converter. If one of the renderer chokes on some HTML problem (like unclosed tag), it might help it.




I dont think a php class will be the best for render an xHtml page with css.


What happen when a new css rule come out? (soon css 3.0...)

当一个新的css规则出现时,会发生什么?(很快css 3.0…)

The best way to render an html page is, obvisiuly, a browser. Firefox 3.0 can natively 'print' in pdf format, torisugary developed an extension (command line print) to use it. Here you'll find it.

呈现html页面的最佳方式是浏览器。Firefox 3.0可以以pdf格式“打印”,tori糖类开发了一个扩展(命令行打印)来使用它。在这里你会找到它。

Anyway, there are still many problmes runninr firefox just as a pdf converter...

无论如何,仍然有很多问题,runninr firefox仅仅作为一个pdf转换器……

At the moment, i think that wkhtmltopdf is the best (that is the one used by the safari browser), fast, quick, awesome. Yes, opensource as well... Give it a look




Although there are many solutions offered already, I recommend the following two:


  1. HTM2PDF - offers an API to convert HTML to PDF and also has a PHP SDK, which makes it very easy to implement in PHP; It offers a choice of server locations in Europe, Asia and the USA
  2. HTM2PDF -提供一个将HTML转换为PDF的API,并且还有一个PHP SDK,这使得在PHP中实现它非常容易;它提供在欧洲、亚洲和美国的服务器地点的选择
  3. PDFmyURL - offers an API that does URL and HTML to PDF as well, with roughly the same functionality as HTM2PDF, but works on a load balanced landscape and has been around a little longer
  4. PDFmyURL——提供了一个API,可以将URL和HTML转换为PDF,其功能与HTM2PDF大致相同,但是可以在负载均衡的环境下工作,而且已经运行了一段时间

The thing that's different about these two APIs from all the previously mentioned solutions, is that - besides converting HTML to PDF with CSS and JavaScript - it also offers PDF rights management, watermarking and encryption. Therefore it's an all-in-one solution for those who want to hit the ground running.


Disclaimer: I work for Kaiomi, a company that operates both of these websites.




Fine rendering doesn't mean anything. Does it validate?


All browsers do the most they can to just show something on the screen, no matter how bad the input. And of course they do not do the same thing. If you want the same rendering as FireFox, you could use its rendering engine. There are pdf generators for it. It is an awful lot of work, though.




TCPDF works fine, no dependencies, is free and constantly bugfixed. It has reasonable speed if supplied HTML/CSS contents is well formated. I normally generate from 50 - 300 kB of HTML input (including CSS) and get PDF output within 1-3 secs with 10 - 15 PDF pages.

TCPDF工作得很好,没有依赖,是免费的,并且经常存在错误。如果提供的HTML/CSS内容格式良好,则具有合理的速度。我通常从50 - 300kb的HTML输入(包括CSS)中生成PDF,并在1-3秒内输出10 - 15个PDF页面。

I strongly recommend using tidy library as HTML pretty formatter before sending anything to TCPDF.




I recommend TCPDF or DOMPDF, is that order




I've tried a lot of different libraries for PHP. All the listed I've tried. In my opinion TCPDF library is the best compromise performance/usability. It's very simply to install and use, also good performance in small medium application. If you need high performance and very big PDF document, use Zend_PDF module, but get ready to coding hard!




Does the HTML to PDF conversion really need to occur server-side using PHP?


I just came across jsPDF, a client-side solution using HTML5/JavaScript. The MIT-licensed code is on GitHub, too.





If there is people who always search for this kind of stuff, there is a free website which allow you to convert html code & pages to pdf. There is also a (very small) api which allow you to get pdf file from url.


Check it here




not PHP, but a Java library, which does the thing:


Flying Saucer takes XML or XHTML and applies CSS 2.1-compliant stylesheets to it, in order to render to PDF

飞碟采用XML或XHTML并对其应用符合CSS 2.1的样式表,以便呈现为PDF

It is usable from PHP via system() or a similar call. Although it requires XML well-formedness of the input.




Important: Please note that this answer was written in 2009 and it might not be the most cost-effective solution today in 2018. Online alternatives like PDFShift are better today at this than they were back then.


Have a look at PrinceXML.


It's definitely the best HTML/CSS to PDF converter out there, although it's not free (But hey, your programming might not be free either, so if it saves you 10 hours of work, you're home free (since you also need to take into account that the alternative solutions will require you to setup a dedicated server with the right software)

这绝对是最好的HTML / CSS PDF转换器,虽然这不是免费的(但是,嘿,你的编程可能不是免费的,如果它可以节省你10个小时的工作,你回来免费(因为您还需要考虑替代方案将要求您设置一个专用的服务器使用正确的软件)

Oh yeah, did I mention that this is the first (and probably only) HTML2PDF solution that does full ACID2 ?


PrinceXML Samples




Have a look at wkhtmltopdf . It is open source, based on webkit and free.


We wrote a small tutorial here.


EDIT( 2017 ):


If it was to build something today, I wouldn't go that route anymore.
But would use http://pdfkit.org/ instead.
Probably stripping it of all its nodejs dependencies, to run in the browser.




After some investigation and general hair-pulling the solution seems to be HTML2PDF. DOMPDF did a terrible job with tables, borders and even moderately complex layout and htmldoc seems reasonably robust but is almost completely CSS-ignorant and I don't want to go back to doing HTML layout without CSS just for that program.


HTML2PDF looked the most promising but I kept having this weird error about null reference arguments to node_type. I finally found the solution to this. Basically, PHP 5.1.x worked fine with regex replaces (preg_replace_*) on strings of any size. PHP 5.2.1 introduced a php.ini config directive called pcre.backtrack_limit. What this config parameter does is limits the string length for which matching is done. Why this was introduced I don't know. The default value was chosen as 100,000. Why such a low value? Again, no idea.

HTML2PDF看起来是最有希望的,但是我一直有这个关于node_type的空引用参数的奇怪错误。我终于找到了解决办法。基本上,PHP 5.1。在任何大小的字符串中,x都可以使用regex替换(preg_replace_*)。PHP 5.2.1引入了PHP。ini配置指令称为pcr .backtrack_limit。这个配置参数的作用是限制完成匹配的字符串长度。我不知道为什么会出现这种情况。默认值选择为100,000。为什么这么低的价值?再一次,不知道。

A bug was raised against PHP 5.2.1 for this, which is still open almost two years later.

针对PHP 5.2.1提出了一个bug,该bug在两年后仍然开放。

What's horrifying about this is that when the limit is exceeded, the replace just silently fails. At least if an error had been raised and logged you'd have some indication of what happened, why and what to change to fix it. But no.


So I have a 70k HTML file to turn into PDF. It requires the following php.ini settings:


  • pcre.backtrack_limit = 2000000; # probably more than I need but that's OK
  • pcre。backtrack_limit = 2000000;可能比我需要的多,但没关系
  • memory_limit = 1024M; # yes, one gigabyte; and
  • memory_limit = 1024;#是的,1 g;和
  • max_execution_time = 600; # yes, 10 minutes.
  • max_execution_time = 600;#是的,10分钟。

Now the astute reader may have noticed that my HTML file is smaller than 100k. The only reason I can guess as to why I hit this problem is that html2pdf does a conversion into xhtml as part of the process. Perhaps that took me over (although nearly 50% bloat seems odd). Whatever the case, the above worked.


Now, html2pdf is a resource hog. My 70k file takes approximately 5 minutes and at least 500-600M of RAM to create a 35 page PDF file. Not quick enough (by far) for a real-time download unfortunately and the memory usage puts the memory usage ratio in the order of 1000-to-1 (600M of RAM for a 70k file), which is utterly ridiculous.

html2pdf是一个资源占有者。我的70k文件需要大约5分钟和至少500-600M的RAM来创建一个35页的PDF文件。不幸的是,对于实时下载来说(到目前为止)还不够快,内存使用使内存使用率达到了1000比1 (70k文件的600M内存),这是非常荒谬的。

Unfortunately, that's the best I've come up with.




Why don’t you try mPDF version 2.0? I used it for creating PDF a document. It works fine.


Meanwhile mPDF is at version 5.7 and it is actively maintained, in contrast to HTML2PS/HTML2PDF


But keep in mind, that the documentation can really be hard to handle. For example, take a look at this page: https://mpdf.github.io/.


Very basic tasks around html to pdf, can be done with this library, but more complex tasks will take some time reading and "understanding" the documentation.




1) use MPDF !

a) extract in yourfolder


b) create file.php in yourfolder and insert such code:


$mpdf=new mPDF();
$mpdf->WriteHTML('<p style="color:red;">Hallo World<br/>Fisrt sentencee</p>');
$mpdf->Output();   exit;

c) open file.php from your browser


2) Use pdfToHtml !

1) extract pdftohtml.exe to your root folder:


2) inside that folder, in anyfile.php file, put this code (assuming, there is a source example.pdf too):



    if (!file_exists($output_fold)) { mkdir($output_fold, 0777, true);}
$result= passthru("pdftohtml $source $output_fold/new_filename",$log);
//var_dump($result); var_dump($log);

3) enter FinalFolder, and there will be the converted files (as many pages, as the source PDF had..)




Checkout TCPDF. It has some HTML to PDF functionality that might be enough for what you need. It's also free!




I suggest DocRaptor (which uses PrinceXML as the "engine")




Just to bump the thread, I've tried DOMPDF and it worked perfectly. I've used DIV and other block level elements to position everything, kept it strictly CSS 2.1 and it played veru nicely.

我试过了DOMPDF,它运行得很好。我使用了DIV和其他块级元素来定位所有的东西,严格地保持它的CSS 2.1并且它运行得很好。



Good news! Snappy!!

好消息!时髦的! !

Snappy is a very easy open source PHP5 library, allowing thumbnail, snapshot or PDF generation from a url or a html page. And... it uses the excellent webkit-based wkhtmltopdf


Enjoy! ^_^

享受吧!^ _ ^



Well if you want to find a perfect XHTML+CSS to PDF converter library, forget it. It's far from possible. Because it's just like finding a perfect browser (XHTML+CSS rendering engine). Do we have one? IE or FF?


I have had some success with DOMPDF. The thing is that you have to modify your HTML+CSS code to go with the way the library is meant to work. Other than that, I have pretty good results.


See below:


Original HTML


Converting HTML to PDF




It's already been mentioned, but I'd just like to confirm that mpdf is the easiest, most powerful and most free html to pdf converter out there. The sky's really the limit. You can even generate pdfs of dynamic, user generated data.


For instance, a client wanted a CMS system so he could update the track list of the music he played at his club. That was no problem, but he also wanted users to be able download a .pdf of the playlist, and so this downloadable pdf had to be updated by the cms too. Thanks to mpdf, with some simple loops and interspersed variables I could do just that. Something that I thought would take me weeks literally took me minutes.


Great article that helped me get started.




I am using fpdf to produce pdf files using php. It's working well for me so far to produce simple outputs.




The HTML2PDF and HTML2PS that was originally mentioned in opening post was talking about a 2009 package with this link


But there is a better HTML2PDF


It is based on TCPDF though it is partly in French.


You can have table headers or footers that repeat on the pages and have page numbers and total pages. See its examples. I have been using it for over three years and recommend it.




There's a tutorial on Zend's devzone on generating pdf from php (part 1, part 2) without any external libraries. I never implemented this sort of solution, but since it's all php, you might find it more flexible to implement and debug.




In terms of cost, using a web-service (API) may in many cases be the more sensible approach. Plus, by outsourcing this process you unburden your own infrastructure/backend and - provided you are using a reputable service - ensure compatibility with adjusting web standards, uptime, short processing times and quick content delivery.


I've done some research on most of the web services currently on the market, please find below the APIs that I feel are worth mentioning on this thread, in an order based on price/value ratio. All of them are offering pre-composed PHP classes and packages.


  1. pdflayer.com - Cost: $ - Quality: ☆☆☆☆
  2. pdflayer.com成本:$ -质量:☆☆☆☆
  3. docraptor.com - Cost: $$$ - Quality: ☆☆☆☆☆
  4. docraptor.com成本:$ $ $ -质量:☆☆☆☆☆
  5. pdfcrowd.com - Cost: $$ - Quality: ☆☆☆
  6. pdfcrowd.com成本:$ $ -质量:☆☆☆



Having the high-quality engine PrinceXML as a backbone, DocRaptor clearly offers the best PDF quality, returning highly polished and well converted PDF documents. However, the pdflayer API service gets pretty close here. Pdfcrowd does not necessarily score with quality, but with processing speed.

DocRaptor拥有高质量的引擎PrinceXML作为主干,显然提供了最好的PDF质量,返回高度抛光和良好转换的PDF文档。但是,pdflayer API服务在这里非常接近。Pdfcrowd并不一定以质量得分,而是以处理速度得分。



pdflayer.com - As indicated above, the most cost-effective option here is pdflayer.com, offering an entirely free subscription plan for 100 monthly PDFs and premium subscriptions ranging between $9.99-$119.99. The price for 10,000 monthly PDF documents is $39.99.


docraptor.com - Offering a 7-Day Free Trial period. Premium subscription plans range from $15-$2250. The price for 10,000 monthly PDF documents is ~ $300.00.


pdfcrowd.com - Offering 100 PDFs once for free. Premium subscription plans range from $9-$89. The price for 10,000 monthly PDF documents is ~ $49.00.

pdfcrowd.com -免费提供100个pdf文件。保费认购计划从9美元到89美元不等。每月10,000份PDF文件的价格在$49.00左右。

I've used all three of them and this text is supposed to help anyone decide without having to pay for all of them. This text has not been written to endorse any one product and I have no affiliation with any of the products.




If you have access to the command line it's possible to use PhantomJS to create the PDF from an URL (remote or local).


It works really well, and is a free solution.


Take a look at this example script made for this exact problem.




Darryl Hein's mention above of TCPDF is likely a great idea. Nicola Asuni's code is pretty handy and powerful. The only killer is if you ever plan on merging PDF files with your generated PDF it doesn't have those features. You would have to create the PDF and then merge it using something like PDFTK by Sid Steward (www.pdflabs.com/tools/pdftk-the-pdf-toolkit/).

Darryl Hein在上面提到TCPDF可能是个好主意。Nicola Asuni的代码非常方便和强大。唯一的杀手是,如果你计划将PDF文件与生成的PDF文件合并,它没有这些功能。您必须创建PDF,然后使用Sid Steward的PDFTK合并它(www.pdflabs.com/tools/pdftk- PDF -toolkit/)。



This question is pretty old already, but haven't seen anyone mentioning CutyCapt so I will :)




CutyCapt is a small cross-platform command-line utility to capture WebKit's rendering of a web page into a variety of vector and bitmap formats, including SVG, PDF, PS, PNG, JPEG, TIFF, GIF, and BMP




Try grabbing the latest nightly dompdf build - I was using an older version that was a terrible resource hog and took forever to render my pdf. After grabbing a nightly from here.


It only took a few seconds to generate the PDF - AND it was just as nicely rendered as with PrinceXML / Docraptor. Seems like they've seriously optimized the dompdf code since I last used it!

只需要几秒钟就能生成PDF,而且它和PrinceXML / Docraptor一样漂亮。似乎自从我上次使用dompdf代码以来,他们已经对它进行了认真的优化!



I developed a public API to build PDF files from web pages. It has a nice PHP client class that makes it super easy to use. It uses wkhtmltopdf to render the PDF in the cloud.


No need for anything special in the HTML. No need for absolute URLS in images/css/js links. Works on localhost (dev machine) too.

在HTML中不需要任何特殊的东西。在图像/css/js链接中不需要绝对url。也可以在localhost (dev机器)上工作。

Currently the service has endpoints in 4 Azure regions: US East, US West, EU North, Southeast Asia.


It's fast since it uses a proprietary protocol to send the web page contents to the API for conversion to PDF.


It's reliable because all endpoints are load balanced.


Free account available for testing or low usage. Details on the web site:






Perhaps you might try and use Tidy before handing the file to the converter. If one of the renderer chokes on some HTML problem (like unclosed tag), it might help it.




I dont think a php class will be the best for render an xHtml page with css.


What happen when a new css rule come out? (soon css 3.0...)

当一个新的css规则出现时,会发生什么?(很快css 3.0…)

The best way to render an html page is, obvisiuly, a browser. Firefox 3.0 can natively 'print' in pdf format, torisugary developed an extension (command line print) to use it. Here you'll find it.

呈现html页面的最佳方式是浏览器。Firefox 3.0可以以pdf格式“打印”,tori糖类开发了一个扩展(命令行打印)来使用它。在这里你会找到它。

Anyway, there are still many problmes runninr firefox just as a pdf converter...

无论如何,仍然有很多问题,runninr firefox仅仅作为一个pdf转换器……

At the moment, i think that wkhtmltopdf is the best (that is the one used by the safari browser), fast, quick, awesome. Yes, opensource as well... Give it a look




Although there are many solutions offered already, I recommend the following two:


  1. HTM2PDF - offers an API to convert HTML to PDF and also has a PHP SDK, which makes it very easy to implement in PHP; It offers a choice of server locations in Europe, Asia and the USA
  2. HTM2PDF -提供一个将HTML转换为PDF的API,并且还有一个PHP SDK,这使得在PHP中实现它非常容易;它提供在欧洲、亚洲和美国的服务器地点的选择
  3. PDFmyURL - offers an API that does URL and HTML to PDF as well, with roughly the same functionality as HTM2PDF, but works on a load balanced landscape and has been around a little longer
  4. PDFmyURL——提供了一个API,可以将URL和HTML转换为PDF,其功能与HTM2PDF大致相同,但是可以在负载均衡的环境下工作,而且已经运行了一段时间

The thing that's different about these two APIs from all the previously mentioned solutions, is that - besides converting HTML to PDF with CSS and JavaScript - it also offers PDF rights management, watermarking and encryption. Therefore it's an all-in-one solution for those who want to hit the ground running.


Disclaimer: I work for Kaiomi, a company that operates both of these websites.




Fine rendering doesn't mean anything. Does it validate?


All browsers do the most they can to just show something on the screen, no matter how bad the input. And of course they do not do the same thing. If you want the same rendering as FireFox, you could use its rendering engine. There are pdf generators for it. It is an awful lot of work, though.




TCPDF works fine, no dependencies, is free and constantly bugfixed. It has reasonable speed if supplied HTML/CSS contents is well formated. I normally generate from 50 - 300 kB of HTML input (including CSS) and get PDF output within 1-3 secs with 10 - 15 PDF pages.

TCPDF工作得很好,没有依赖,是免费的,并且经常存在错误。如果提供的HTML/CSS内容格式良好,则具有合理的速度。我通常从50 - 300kb的HTML输入(包括CSS)中生成PDF,并在1-3秒内输出10 - 15个PDF页面。

I strongly recommend using tidy library as HTML pretty formatter before sending anything to TCPDF.




I recommend TCPDF or DOMPDF, is that order




I've tried a lot of different libraries for PHP. All the listed I've tried. In my opinion TCPDF library is the best compromise performance/usability. It's very simply to install and use, also good performance in small medium application. If you need high performance and very big PDF document, use Zend_PDF module, but get ready to coding hard!




Does the HTML to PDF conversion really need to occur server-side using PHP?


I just came across jsPDF, a client-side solution using HTML5/JavaScript. The MIT-licensed code is on GitHub, too.





If there is people who always search for this kind of stuff, there is a free website which allow you to convert html code & pages to pdf. There is also a (very small) api which allow you to get pdf file from url.


Check it here




not PHP, but a Java library, which does the thing:


Flying Saucer takes XML or XHTML and applies CSS 2.1-compliant stylesheets to it, in order to render to PDF

飞碟采用XML或XHTML并对其应用符合CSS 2.1的样式表,以便呈现为PDF

It is usable from PHP via system() or a similar call. Although it requires XML well-formedness of the input.
