I am using Apache PDFBox and Java to parse the PDFs and get all the information from it. Extracting text is working fine for English only. For other languages I get only som
Try changing the Java system locale. From your Java program, this should be equivalent to changing the OS setting.