Spark : How to append to cached rdd?

不羁的心 提交于 2019-12-13 05:18:37

问题


  • Distinct values are cached with every streamed batch of data.
  • How do i build the cache by adding the next distinct values in the next batch to the already cached RDD?

回答1:


You can not directly append your data with Rdd because its immutable. Using union to create new Rdd and then cache it.



来源:https://stackoverflow.com/questions/34077905/spark-how-to-append-to-cached-rdd

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!