I\'d like to know the possible ways to implement batch normalization layers with synchronizing batch statistics when training with multi-GPU.
Caffe May
A specialized keras layer SyncBatchNormalization is available Since TF2.2 https://www.tensorflow.org/api_docs/python/tf/keras/layers/experimental/SyncBatchNormalization