So I am trying to implement quantization-aware training for a custom PyTorch model. I have been looking at the functionality PyTorch (and also other frameworks, such as Tens