solution to convert PDFs, DOCs, DOCXs into a textual format with python

后端 未结 4 1745
独厮守ぢ
独厮守ぢ 2021-01-16 17:29

I am developing a full text search engine for indexing popular binary formats. I know that there are hundereds of such questions (and solutions) already, but I found it toug

4条回答
  •  心在旅途
    2021-01-16 17:59

    Textract uses the default tools for every kind of file.

    https://github.com/deanmalmgren/textract

提交回复
热议问题