poppler

利用 Manjaro Linux 内置命令行工具准备复试材料

依然范特西╮ 提交于 2020-08-04 23:26:25
研究生考试复试材料需要把扫描文档合并为一个pdf文件(微软 Office Lens、万能扫描王、扫喵等可在手机上把纸质文档转换成图片,其中微软的 Office Lens 可以拍照直接保存为 PDF,还能检测并裁掉边角,还你规矩的图片文档,墙裂推荐),其中还涉及的一个中间环节是扫描图片转pdf文档。Windows 和 Android 下可以考虑使用格式工厂、迅捷PDF等做多种文档的相互转换;此外听说苹果手机自带PDF转换合并的应用(嗯,资本的力量);对于 Firefox 等浏览器,打开图片然后打印为PDF也不失为一种图片转pdf的办法;而作为英雄云集的 Manjaro Linux 的用户,我的 Linux 解法主要用到两个工具,确认过眼神,都是 Manjaro Linux 官方仓库中的工具(开源的力量!): imagemagick - 图片转pdf 参考资料 用法: 单图 convert page.png page.pdf 多图 convert page*.png mydoc.pdf poppler - pdf合并 参考资料 用法: pdfunite in-1.pdf in-2.pdf in-n.pdf out.pdf 或者用以下命令一网打尽: pdfunite *.pdf out.pdf 来源: oschina 链接: https://my.oschina.net/baytars

Unable to Import Poppler even after installing in conda

荒凉一梦 提交于 2020-05-17 06:46:24
问题 I am trying to use pdf rendering package Poppler and I found an Anaconda Installation for the same here https://anaconda.org/conda-forge/poppler I can see the Poppler package installed in my conda env when I do conda <env> list However when I try to import the package in my code by doing import poppler I get : ModuleNotFoundError: No module named 'poppler' How do I find if this is the right name of the module if its not the name shown in then conda env list.. I believe the package name is the

converting pdf to image but after zooming in

老子叫甜甜 提交于 2020-05-14 20:48:07
问题 This link shows how pdf s could be converted to images. Is there a way to zoom my pdf s before converting to images? In my project, i am converting pdf s to png s and then using Python-tesseract library to extract text. I noticed that if I zoom pdf s and then save parts as png s then OCR provides much better results. So is there a way to zoom pdfs before converting to pngs? 回答1: I think that raising the quality (resolution) of your image is a better solution than zooming into the pdf. using

Error installing python-poppler-qt5 on Windows10

只谈情不闲聊 提交于 2020-01-22 02:30:16
问题 I am trying to pip install pip install python-poppler-qt5, after working through many many errors I now recieve the error below: (I have poppler-0.68.0 in my program files and added to PATH). pip install python-poppler-qt5 Collecting python-poppler-qt5 Using cached https://files.pythonhosted.org/packages/6a/7d/65a14ece5dd6a1564b576c1ca30b0f5639be64cc55b62b4d2b497159ed43/python-poppler-qt5-0.75.0.tar.gz Installing collected packages: python-poppler-qt5 Running setup.py install for python

converting PDF to JPEG on AWS Lambda

Deadly 提交于 2020-01-02 09:37:10
问题 I've written a small lambda Python function that makes use of pdf2image to convert a PDF file's pages into separate JPG files. This library is a wrapper around poppler-utils , in particular pdftoppm . Works fine on my Ubuntu system, but of course AWS Lambda is different. So I went to look for a version of poppler that I could compile on an EC2 instance for AWS Lambda and found the slightly dated Poppler-build ansible playbook. With some tweaks, I got it to work, however the Poppler version

have you got a py-poppler-qt example?

混江龙づ霸主 提交于 2020-01-02 06:35:09
问题 I'm developing an application in PyQt4 that eventually has to open and show PDF files. For this task there is a python library: python-poppler (in various spelling flavours). The problem is that it is terribly under documented and the only simple working example I found so far uses Python+Gtk+Cairo, while the example with Python+Qt I found uses an older version of the library, and many major changes have occurred ever since, hence it doesn't work anymore. It's a week I'm trying to use the

Merge remote repository commits to the local

心已入冬 提交于 2019-12-24 13:34:41
问题 I have two remotes repositories with C++ Poppler library, say A (original-poppler) and B (another-poppler) . The main meaning of second B repository is maintaining Poppler library for some Linux distro and building for them rpm packages. B was created from A just by copying files - I see the first commit Ported latest version and many committed files - it should be from some A commit (version 0.20 ). Starting from this "Ported latest version" repository B lives his life - are added some

Has anyone been able to use poppler new_from_data in python?

杀马特。学长 韩版系。学妹 提交于 2019-12-24 09:03:56
问题 Using Python3, and Poppler, I can load files with new_from_file without problem, but new_from_data is problematic. Here is the code which is obviously a simple test, because it does not make sense to read from file and then use new_from_data, since new_from_file works perfectly, but I could not post here the full code generating the pdf file. from gi.repository import Poppler, Gtk def draw(widget, cr): # set background. cr.set_source_rgb(0.7, 0.6, 0.5) cr.paint() # set page background cr.set

Install poppler onto Heroku Server django

我的梦境 提交于 2019-12-24 04:02:35
问题 I am trying to install poppler on my Heroku server because I am using pdf2image as a python package. However, I can't just run brew install poppler like I did on my Mac. I have tried to add some heroku buildpacks off the internet but with no luck. Anytime pdf2image runs I get this error. pdf2image.exceptions.PDFInfoNotInstalledError: Unable to get page count. Is poppler installed and in PATH? Is there something I can do on the command line to get poppler installed while keeping heroku/python