Ways to implement multi-GPU BN layers with synchronizing means and vars

前端 未结 3 1650
盖世英雄少女心
盖世英雄少女心 2021-02-05 22:53

I\'d like to know the possible ways to implement batch normalization layers with synchronizing batch statistics when training with multi-GPU.

Caffe May

3条回答
  •  攒了一身酷
    2021-02-05 23:20

    I've figured out a way to implement sync batch norm in pure tensorflow and pure python.

    The code makes it possible to train PSPNet on Cityscapes and get comparable performance.

提交回复
热议问题