I\'m working on a deep learning project trying to automatically detect the joints of people given an image, and I am stuck trying to feed the data in the correct format to m