I recently started learning deep learning and working on a project for crowd detection. I thought of using R-FCN. But Don\'t have enough idea about it, like how is it going to w