Pyspark num of actual partitions in groupby vs shuffle partitions for Dataframe

前端 未结 0 392
隐瞒了意图╮
隐瞒了意图╮ 2020-11-30 03:02

I have a Movielens CSV dataset file with columns as \'movieID\',\'UserID\', \'Rating\', \'Timestamp\'. I aggregated each movie rating by count and average. Below is my code.

相关标签:
回答
  • 消灭零回复
提交回复
热议问题