How to predict the numbers in a list or 2D Array by using reinforcement learning, for example, if I need to assign the number of one list to the second list I have to predict th