Instance Normalisation vs Batch normalisation

前端未结

关注

 4  1379

攒了一身酷 2021-01-29 19:25

I understand that Batch Normalisation helps in faster training by turning the activation towards unit Gaussian distribution and thus tackling vanishing gradients problem. Batch

4条回答

生来不讨喜 (楼主)

2021-01-29 20:21
I wanted to add more information to this question since there are some more recent works in this area. Your intuition

use instance normalisation for image classification where class label should not depend on the contrast of input image

is partly correct. I would say that a pig in broad daylight is still a pig when the image is taken at night or at dawn. However, this does not mean using instance normalization across the network will give you better result. Here are some reasons:
1. Color distribution still play a role. It is more likely to be a apple than an orange if it has a lot of red.
2. At later layers, you can no longer imagine instance normalization acts as contrast normalization. Class specific details will emerge in deeper layers and normalizing them by instance will hurt the model's performance greatly.
IBN-Net uses both batch normalization and instance normalization in their model. They only put instance normalization in early layers and have achieved improvement in both accuracy and ability to generalize. They have open sourced code here.
0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...