PDFminer: extract text with its font information

后端 未结 6 1175
伪装坚强ぢ
伪装坚强ぢ 2021-02-08 03:26

I find this question, but it uses command line, and I do not want to call a Python script in command line using subprocess and parse HTML files to get the font information.

6条回答
  •  小蘑菇
    小蘑菇 (楼主)
    2021-02-08 03:49

    Some informations are in lower level, in the LTChar class. It seems logic because font size, italic, bold, etc, can be applied to a single character.

    More infos here : https://github.com/euske/pdfminer/blob/master/pdfminer/layout.py#L222

    But I'm still confuse about font color not in this class

提交回复
热议问题