Performance of UDAF versus Aggregator in Spark

问题

I am trying to write some performance-mindful code in Spark and wondering whether I should write an Aggregator or a User-defined Aggregate Function (UDAF) for my rollup operations on a Dataframe.

I have not been able to find any data anywhere on how fast each of these methods are and which you should be using for spark 2.0+.

来源：https://stackoverflow.com/questions/45356452/performance-of-udaf-versus-aggregator-in-spark

标签

performance

apache-spark

spark-dataframe

aggregate-functions

apache-spark-2.0

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!