I am solving https://www.kaggle.com/c/ieee-fraud-detection problem with datasets which are huge. So before doing any machine learning stuffs I want to reduce the size of dataset