Apache Tika Server - Request Header Parameters?
问题 The Apache Tika Server provides a Rest API to extract text from a document. It is also possible to set specific request header parameters like X-Tika-PDFOcrStrategy . e.g: $ curl -T test/Dokument01.pdf http://localhost:9998/tika --header "X-Tika-PDFOcrStrategy: ocr_only" From a lot of different documents about tika I found these documented additional header parameters: X-Tika-OCRLanguage: eng X-Tika-PDFextractInlineImages: true | false X-Tika-PDFOcrStrategy: ocr_only | ocr_and_text_extraction