I have 100,000\'s of tar.gz files that contain a JSON file and a CSV file. I\'d like to load only the CSV files into a DataFrame using Pyspark. It needs to be done
tar.gz