The original Grad-CAM paper (link) mentions obtaining the gradients of the class score y^c before softmax. However, the Keras implementation (link) does not seem to do