Pytesser set character whitelist

前端 未结 1 1049
野性不改
野性不改 2021-02-20 01:12

Does anyone know how to set the character whitelist for Pytesseract? I want it to only output A-z and 0-9. Is this possible? I have the following:

img = Image.op         


        
相关标签:
1条回答
  • 2021-02-20 02:00

    You can accomplish that with the below line. Or you can setup the config file for tesseract to do the same thing Limit characters tesseract is looking for

    pytesseract.image_to_string(question_img, config="-c tessedit_char_whitelist=0123456789abcdefghijklmnopqrstuvwxyz -psm 6")
    

    I am sure there are other ways to get it work, but this is what worked for me.

    0 讨论(0)
提交回复
热议问题