My goal is to find unsupervised full body landmarks. For that purpose I am using an autoencoder structure to disentangle shape and appearance of full body images (deep fashion d