I have a very large dataset that needs to be used for classification, I sampled the data, but that does not guarantee that I will have the whole labels in my output. How can