PPO implementation in tensorflow does not converge

后端 未结 0 1202
说谎
说谎 2021-02-04 03:27

Based on pytorch-a2c-ppo-acktr-gail and tf-a2c-ppo I based my implementation of PPO in tensorflow. The A2C and PPO share the same model which converges perfectly fine for A2C.

相关标签:
回答
  • 消灭零回复
提交回复
热议问题