I\'m starting to study deep learning and I\'m practicing softmax classification with MNIST datset. It was okay when I used sigmoid function, but cost didn\'t de