Reinforcement Learning With Variable Actions

后端 未结 3 1292
眼角桃花
眼角桃花 2021-02-05 20:28

All the reinforcement learning algorithms I\'ve read about are usually applied to a single agent that has a fixed number of actions. Are there any reinforcement learning algorit

3条回答
  •  面向向阳花
    2021-02-05 21:02

    What you describe is nothing unusual. Reinforcement learning is a way of finding the value function of a Markov Decision Process. In an MDP, every state has its own set of actions. To proceed with reinforcement learning application, you have to clearly define what the states, actions, and rewards are in your problem.

提交回复
热议问题