How can this be completed with the Google Vision-API please?
Currently Label Detection does not provide this functionality. We are always looking at ways to enhance the API
After two years, its the same. I am facing similar challenges and I am thinking of opting other solutions. I think custom solutions like TensorFlow object detection API
or DarkNet YOLO object API
will do this job very easily.