Pyspark: Load similar parquets from different directories and combine into one DataFrame with the folder name as a column

前端 未结 0 1753
小蘑菇
小蘑菇 2021-02-04 22:35

I have several parquets in a similar folder structure:

\'/raw-files/17001/result.parquet\'
\'/raw-files/17002/result.parquet\'
\'/raw-files/...../result.parquet\'         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题