Rllib OfflineData preparation for SAC

后端 未结 0 1064
天涯浪人
天涯浪人 2021-01-18 17:11

I have some offline experiences: (s, a, r, s\') that were generated with a heuristic. And I want to use these when training SAC agents. Using the example saving_experiences

相关标签:
回答
  • 消灭零回复
提交回复
热议问题