I would like to use Pytorch to learn the 6 parameters in this small CRF model. I can implement a forward method, loss function and optimizer - but it\'s unclear how to "