I\'d like to know the possible ways to implement batch normalization layers with synchronizing batch statistics when training with multi-GPU.
Caffe May
I've figured out a way to implement sync batch norm in pure tensorflow and pure python.
The code makes it possible to train PSPNet on Cityscapes and get comparable performance.