Does Convolutional Neural Network possess localization abilities on images?

落花浮王杯 提交于 2019-11-30 07:44:43

Thats an open problem in image recognition. Besides sliding windows, existing approaches include predicting object location in image as CNN output, predicting borders (classifiyng pixels as belonging to image boundary or not) and so on. See for example this paper and references therein.

Also note that with CNN using max-pooling, one can identify positions of feature detectors that contributed to object recognition, and use that to suggest possible object location region.

There are some recent techniques to localize the objects in CNN's. See this paper http://cnnlocalization.csail.mit.edu/Zhou_Learning_Deep_Features_CVPR_2016_paper.pdf

It uses a layer called Global Average Pooling (GAP), and with no additional work, the CNN can localize the object it recognizes.

Also checkout this really good blog post: https://alexisbcook.github.io/2017/global-average-pooling-layers-for-object-localization/

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!