I have a 12GB .tgz file. Inside of that file, there are .csv.gz files. I want to use this data for machine learning to classify user category. Before I jump into this big fi