chinese character recognition using Tesseract OCR

后端 未结 1 815
慢半拍i
慢半拍i 2021-02-02 16:18

I have been using Tesseract 3.0.2 OCR SDK for image text extraction. But if I use Chinese text images and pass through OCR then Tesseract doesn\'t provide me the Chinese charact

相关标签:
1条回答
  • 2021-02-02 17:00

    You need to download chinese trained data (it will be a file like chi_sim.traineddata) and add it to your tessdata folder.

    To download the file https://github.com/tesseract-ocr/tessdata/raw/master/chi_sim.traineddata

    and use like this

    Tesseract* tesseract= [[Tesseract alloc] initWithDataPath:@"tessdata" language:@"chi_sim"];
    

    if you have any problem you can download my experiment with tessaract (with chinese language support) from https://github.com/aryansbtloe/ExperimentWithTesseract.git

    I have tested this one...Hope you will find this useful.

    0 讨论(0)
提交回复
热议问题