Convert PDF to HTML [closed]

我的未来我决定 提交于 2019-11-26 07:35:35

问题


What is the best solution to convert PDF documents to be viewed in the browser as HTML? The site has several PDF documents and the visitor can click on view as HTML and this should be viewed on the screen as an HTML file.

Standard website running PHP, Linux, Apache.


回答1:


pdftohtml works fine : fast, stable but the html result is ugly at best. I have used it for quite some time for a web site that has many job resumes.

It is a good solution for extracting textual content however.

I would give the scribd API a try

or the google apps document API. GOogle does a great job a displaying and converting pdf files




回答2:


Have you considered keeping the PDF data in a database and then either dynamically creating the PDF or the html page depending on what the visitors select?




回答3:


If you have command line access at your hosting provider, there is a utility called pdftohtml inside of the poppler_utils package.

http://poppler.freedesktop.org/

Looks quite easy to use, have not called it from inside of PHP, but it should work.




回答4:


If you are prepared to call Java from PHP you could have a look at http://www.jpedal.org/html_index.php



来源:https://stackoverflow.com/questions/956508/convert-pdf-to-html

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!