Is pyspark.sql grouped data putting each group into a single partition?

后端 未结 0 1287
孤独总比滥情好
孤独总比滥情好 2021-02-03 10:26

When I use group aggregate pandas UDFs in GroupedData.agg(), how will this UDF be applied to the data in a single group? I would think that spark is first partitioning the data

相关标签:
回答
  • 消灭零回复
提交回复
热议问题