In every forward pass of the model, I want to implement l2 normalization on the softmax layer\'s columns, then set the weights back as per the imprinted weights paper and th