I am developing a full text search engine for indexing popular binary formats. I know that there are hundereds of such questions (and solutions) already, but I found it toug
Textract uses the default tools for every kind of file.
https://github.com/deanmalmgren/textract