I am building a simulation of a production process using simpy and an agent (Heuristic or Reinforcement Learning via tensorforce) that allocates orders to s