How to extract charts/tables/graphs from PDF files using Python?

前端 未结 2 686
余生分开走
余生分开走 2021-01-19 03:03

Searched quite a bit but as I couldn\'t find a solution for this kind of problem, hence posting a clear question on the same. Most answers cover image/text extraction which

相关标签:
2条回答
  • 2021-01-19 03:39

    Try using PyMuPdf(https://github.com/pymupdf/PyMuPDF/tree/1.18.3) for amalgamation of texts, bars, lines and axis. It has so many extra utilities.

    0 讨论(0)
  • 2021-01-19 03:53

    For extracting tables you can use camelot

    Here is an article about it.

    For images I've found this question and answer Extract images from PDF without resampling, in python?

    0 讨论(0)
提交回复
热议问题