How to group blocks that are part of a bigger sentences in Google Cloud Vision API?
问题 I am using Google Cloud Vision API on Python to detect text values in hoarding boards that are usually found above a shop/store. So far I have been able to detect individual words and their bounding polygons' coordinates. Is there a way to group the detected words based on their relative positions and sizes? For example, the name of the store is usually written in same size and the words are aligned. Does the API provide some functions that group those words which probably are parts of a