We are very familiar with regular image recognition and achived great performance, such as cat-dog recognition. But I come cross a slightly different task, say how to perform im