How to extract charts/tables/graphs from PDF files using Python?

前端未结

关注

 2  686

Searched quite a bit but as I couldn\'t find a solution for this kind of problem, hence posting a clear question on the same. Most answers cover image/text extraction which

相关标签:

2条回答

旧时难觅i

2021-01-19 03:39

Try using PyMuPdf(https://github.com/pymupdf/PyMuPDF/tree/1.18.3) for amalgamation of texts, bars, lines and axis. It has so many extra utilities.

0 讨论(0)
发布评论:

提交评论
- 加载中...
甜味超标

2021-01-19 03:53

For extracting tables you can use camelot

Here is an article about it.

For images I've found this question and answer Extract images from PDF without resampling, in python?

0 讨论(0)
发布评论:

提交评论
- 加载中...