How to get Unicode of the characters from PDF using java and PDFBox

后端 未结 2 680
死守一世寂寞
死守一世寂寞 2021-01-19 14:26

I am using Apache PDFBox and Java to parse the PDFs and get all the information from it. Extracting text is working fine for English only. For other languages I get only som

2条回答
  •  遥遥无期
    2021-01-19 14:54

    Try changing the Java system locale. From your Java program, this should be equivalent to changing the OS setting.

提交回复
热议问题