Apache Spark (PySpark): sort parquet data globally and divide in equal size

前端 未结 0 1277
走了就别回头了
走了就别回头了 2020-11-29 16:44

Is that possible to have parquet data sorted across all partitions and additionally divide the data in equal partitions (counting how much data I have and dividing into 64 o

相关标签:
回答
  • 消灭零回复
提交回复
热议问题