PyTorch - Sharing gradients between different iterations doesn't improve performance

前端 未结 0 1661
时光说笑
时光说笑 2020-12-30 06:16

I need to find the largest eigenvalue of the NN, so I\'m implementing the power method. The main function is a hessian-vector product (HVP), based on PyHessian.

The m

相关标签:
回答
  • 消灭零回复
提交回复
热议问题