Extracting lines from an image to feed to OCR - Tesseract

后端 未结 3 1523
闹比i
闹比i 2021-02-10 20:57

I was watching this talk from pycon http://youtu.be/B1d9dpqBDVA?t=15m34s around the 15:33 mark the speaker talks about extracting lines from an image (receipt) and then feeding

相关标签:
3条回答
  • 2021-02-10 21:29

    Direct answer: you extract lines from an image with Hough Transform. You can find an analytical guide here. Text lines can be detected as well. Karlphillip's answer is based on Hough Transform too.

    enter image description here enter image description here

    0 讨论(0)
  • 2021-02-10 21:31

    i can tell u a simple technique to feed the images to OCR.. just perform some operations to get the ROI (Region of Interest) of ur image, and localize the area where the image after binarizing it.. then you may find contours, and by keeping the threasholding value, and setting the required contour area, you can feed the resulting image to OCR :) .. (sorry for bad way of explaination)

    0 讨论(0)
  • 2021-02-10 21:42

    Take a look at the technique used to detect the skew angle of a text.

    Groups are lines are used to isolate text on an image (this is the interesting part).

    From this result you can easily detect the upper/lower limits of each line of text. The text itself will be located inside them. I've faced a similar problem before, the code might be useful to you:

    All you need to do from here is crop each pair of lines and feed that as an image to Tesseract.

    0 讨论(0)
提交回复
热议问题