RNN Regularization: Which Component to Regularize?
问题 I am building an RNN for classification (there is a softmax layer after the RNN). There are so many options for what to regularize and I am not sure if to just try all of them, would the effect be the same? which components do I regularize for what situation? The components being: Kernel weights (layer input) Recurrent weights Bias Activation function (layer output) 回答1: Regularizers that'll work best will depend on your specific architecture, data, and problem; as usual, there isn't a single