Connected Character segmentation in OpenCV

爱⌒轻易说出口 提交于 2019-11-29 15:16:13

问题


What is a good method to segment characters that are united as in the following figure, knowing that:

  • characters have this font, but the font size varies based on the image size
  • only isolated groups of characters from the image are connected

Also, how can i detect if in a given bounding box, there are 2 or more letters which are connected?

I tried with checking for width > height for detecting connected characters but it doesn't work for the blue groups in the image.

I also tried a segmentation method based on: Article section 3.4 for separating characters but got poor results.


回答1:


IDEA: if you have a good ocr already, you can try to apply ocr all these connected components (or contours). If ocr cant detect a letter; than there is not 1 letter, there are 2 or more.

IDEA: check convexity defects of these connected components, the closest defect points are where the bridges are.

IDEA: use a kernel having small width & big height for erosion+dilation (morphological opening)

IDEA: take y-derivative of the image. The smallest contours (or lines) left will be your bridges. Mark them and erase those pixels from the original image.

IDEA: search problem approach: Take 2 letters from alphabet (and this font), connect them horizontally with some tool and use matchShapes method (moment match) of opencv to find if that shape matches with your connected component. Or try to implement autocorrelation.

good luck.



来源:https://stackoverflow.com/questions/20194103/connected-character-segmentation-in-opencv

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!