I create an initial network model with the following acrchitecture.
def create_model(env): dropout_prob = 0.8 #aggresive dropout regularization num_u