I wrote a squared loss function for categorisation of one hot encoded data
def squared_categorical_loss(y_true, y_pred): return K.mean(K.square(1.0 - K.sum(y_