I have a Image dataset of 500 images, which has 10 objects (labels) that have been annotated. The annotation has been made in Pascal VOC format ( 500 .xml files one for each