Assuming we are trying to train an MNIST classifier, we would have 10 outputs (0-9). When the NN outputs the probabilities, how do we know which probability corresponds to w