Background image cleaning for OCR
问题 Through tesseract-OCR I am trying to extract text from the following images with a red background. I have problems extracting the text in boxes B and D because there are vertical lines. How can I clean the background like this: input: output: some idea? The image without boxes: 回答1: Here are two methods to clean the image using Python OpenCV Method #1: Numpy thresholding Since the vertical lines, horizontal lines, and the background are in red we can take advantage of this and use Numpy