Pyspark: add specific file in tar.gz file to DataFrame

前端 未结 0 810
日久生厌
日久生厌 2021-02-10 18:34

I have 100,000\'s of tar.gz files that contain a JSON file and a CSV file. I\'d like to load only the CSV files into a DataFrame using Pyspark. It needs to be done

相关标签:
回答
  • 消灭零回复
提交回复
热议问题