Tesseract ocr act weird while scalling up image size. How to know which scale factor is best for some particular types of image?
问题 I have this 006.jpg image and i tried following python code I downloaded "eng" from tessdata_best and renamed it to "eng_best" img = cv2.imread(file_path) lang = "eng_best" for img_scale_factor in range (1,8): print(file_path, img_scale_factor) img = cv2.resize(img,None,fx=img_scale_factor,fy=img_scale_factor) hocr_data = pytesseract.image_to_pdf_or_hocr(img, extension="hocr", lang=lang, config="--dpi 1") file_name = '{0:03d}_jpg_{1}_x{3}.{2}'.format(6, lang, "hocr", img_scale_factor) with