How to filter dstream using transform operation and external RDD?

后端 未结 1 1764
清酒与你
清酒与你 2021-01-20 00:54

I used transform method in a similar use case as described in Transform Operation section of Transformations on DStreams:

spamI         


        
相关标签:
1条回答
  • 2021-01-20 01:21

    The difference between the Spark doc example and your code is the use of ssc.checkpoint().

    Although the specific code example you provided will work without checkpoint, I guess you actually require it. But the concept of introducing an external RDD into the scope of a checkpointed DStream is potentially invalid: when recovering from a checkpoint, the external RDD may have changed.

    I tried to checkpoint the external RDD, but I had no luck with it either.

    0 讨论(0)
提交回复
热议问题