Repartitioning Skewed Dataframes in Spark

后端 未结 0 1903
北恋
北恋 2021-01-24 17:18

I have a bit of a question around PySpark.

After aggregating, I have really skewed data (some partitions are massive).

If I repartition; it takes ages, as the data

相关标签:
回答
  • 消灭零回复
提交回复
热议问题