tess4j

Tess4j on Windows 64-bit: exception on multiple threads

两盒软妹~` 提交于 2019-12-04 14:41:45
I am using tesseract 3 with Java 8 on Windows 64-bit to OCR scanned PDFs. I have followed the instructions on the Tess4j page and have used the 64-bit versions of the required DLLs, and have installed 64-bit Ghostscript. When I run my unit test with the normal @Test (no arguments), the code runs correctly , so I guess I have installed everything correctly. When I run it with 2 threads in parallel (see below) I get an exception. I have read the relevant thread here , but there it is suggested to use Tesseract1, which I am using (I have tried both). Any ideas? This is the code: // @Test // works

Tesseract user-pattern is not applied

心不动则不痛 提交于 2019-12-03 14:17:12
I want to do OCR on this image. This is pre-define format. ie first five will characters, then next four will be digits and last will be character. When I execute following command $ tesseract in.png stdout I get output as BDVPD474SQ So, I went for user-pattern. I created a file(in directory /usr/share/tesseract-ocr/tessdata/configs ) named as bazaar (its content is as follow) load_system_dawg F load_freq_dawg F user_patterns_suffix user-patterns I also created a file, named as eng.user-patterns in directory /usr/share/tesseract-ocr/tessdata (its content is as follow) \A\A\A\A\A\d\d\d\d\A

Why is tessdata throwing Exceptions?

流过昼夜 提交于 2019-12-02 08:38:35
问题 All is happening on MAC in SpringBoot WAR file deployed in Tomcat: I have tesseract 4.1.0 installed (via brew on Mac). For Tessdata - OCR on Mac <dependency> <groupId>net.sourceforge.tess4j</groupId> <artifactId>tess4j</artifactId> <version>4.4.0</version> </dependency> I also updated SpringBoot today to: <parent> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-parent</artifactId> <version>2.1.6.RELEASE</version> <relativePath/> <!-- lookup parent from repository -