I am training a model with two optimizers. While the optimizers are optimizing for the same loss function, one optimizer only optimizes for a small subset of parameters and the