My dataset contains videos with varying length and frame rates (25 fps with max length of approx. 10 min). My goal is to classify videos by recognizing certain type of act