I\'m trying to calculate all possible states that my reinforcement learning project (Q-Learning) have, but I don\'t see how to calculate it.
I have a network with 4 s