I am training a DQN with previous two observations as input to the DQN. My model consists of only one Dense layer with 20 units and the output is the probabilities
Dense