All the reinforcement learning algorithms I\'ve read about are usually applied to a single agent that has a fixed number of actions. Are there any reinforcement learning algorit
What you describe is nothing unusual. Reinforcement learning is a way of finding the value function of a Markov Decision Process. In an MDP, every state has its own set of actions. To proceed with reinforcement learning application, you have to clearly define what the states, actions, and rewards are in your problem.