Target values to train against in Deep Q Network

前端 未结 0 1621
庸人自扰
庸人自扰 2021-01-14 10:01

I just recently gotten myself into the concepts of reinforcement learning. I understand the whole gist of Q-learning and its update equation:

Q(s, a) = r + gamma * ma

相关标签:
回答
  • 消灭零回复
提交回复
热议问题