CS231n: How to calculate gradient for Softmax loss function?

后端 未结 2 878
野趣味
野趣味 2021-01-31 03:43

I am watching some videos for Stanford CS231: Convolutional Neural Networks for Visual Recognition but do not quite understand how to calculate analytical gradient for softmax l

2条回答
  •  南笙
    南笙 (楼主)
    2021-01-31 04:13

    I know this is late but here's my answer:

    I'm assuming you are familiar with the cs231n Softmax loss function. We know that: enter image description here

    So just as we did with the SVM loss function the gradients are as follows: enter image description here

    Hope that helped.

提交回复
热议问题