Actor optimization for deep reinforcement learning: a toy model

前端 未结 0 1081
灰色年华
灰色年华 2020-12-01 13:21

Consider a function Q(s,a), and we are interested in a (very simple) task, which is to find:

Importantly, we want to do so by training a neural network that o

相关标签:
回答
  • 消灭零回复
提交回复
热议问题