Google Cloud Vision - Numbers and Numerals OCR

后端 未结 2 1558
伪装坚强ぢ
伪装坚强ぢ 2021-01-17 11:11

I\'ve been trying to implement an OCR program with Python that reads numbers with a specific format, XXX-XXX. I used Google\'s Cloud Vision API Text Recognition, but the res

2条回答
  •  北海茫月
    2021-01-17 11:36

    At this moment it is not possible to add constraints or to give a specific expected number format to Vision API requests, as mentioned here (by the Project Manager of Cloud Vision API).

    You can also check all the possible request parameters (in the API reference), none indicating anything to specify number format. Currently only options to:

    • latLongRect: specify location of the image
    • languageHints: indicating the expected language for text_detection (list of supported languages here)

    I assume you already checked out the multiple responses (with different included image regions) to see if you could reconstruct the text using the location of different digits?

    Note that the Vision API and text_detection is not optimized for your data specifically, if you would have a lot of annotated data, it is also an option to actually build your own model using Tensorflow. This blogpost explains a system setup to detect number plates (with a specific number format). All the code is available on Github and the problem seems very related to yours.

提交回复
热议问题