I have a set of 4 images that each have 1 of 6 classes. Each class can only be present in the set once. The model is trained to detect the viewing direction of objects (fron