Does Convolutional Neural Network possess localization abilities on images?

问题

As far as I know, CNN rely on sliding window techniques and can only indicate if a certain pattern is present or not anywhere in given bounding boxes. Is that true?

Can one achieve localization with CNN without any help of such techniques?

回答1:

Thats an open problem in image recognition. Besides sliding windows, existing approaches include predicting object location in image as CNN output, predicting borders (classifiyng pixels as belonging to image boundary or not) and so on. See for example this paper and references therein.

Also note that with CNN using max-pooling, one can identify positions of feature detectors that contributed to object recognition, and use that to suggest possible object location region.

回答2:

There are some recent techniques to localize the objects in CNN's. See this paper http://cnnlocalization.csail.mit.edu/Zhou_Learning_Deep_Features_CVPR_2016_paper.pdf

It uses a layer called Global Average Pooling (GAP), and with no additional work, the CNN can localize the object it recognizes.

Also checkout this really good blog post: https://alexisbcook.github.io/2017/global-average-pooling-layers-for-object-localization/

来源：https://stackoverflow.com/questions/28178054/does-convolutional-neural-network-possess-localization-abilities-on-images

标签

computer-vision

neural-network

feature-detection

deep-learning

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!