I am trying to understand the difference in using SWA (Tensorflow addons implementation) with and without callbacks. So, I have two implementations and both of them work fin