Checkpointing In ALS Spark Scala

后端 未结 1 1179
予麋鹿
予麋鹿 2020-12-21 02:16

I just want to ask on the specifics how to successfully use checkpointInterval in Spark. And what do you mean by this comment in the code for ALS: https://github.com/apache/

相关标签:
1条回答
  • 2020-12-21 02:18

    How can we set checkPoint directory? Can we use any hdfs-compatible directory for this?

    You can use SparkContext.setCheckpointDir. As far as I remember in local mode both local and DFS paths work just fine, but on the cluster the directory must be a HDFS path.

    Is using setCheckpointInterval the correct way to implement checkpointing in ALS to avoid Stack Overflow errors?

    It should help. See SPARK-1006

    PS: It seems that in order to actually perform check-point in ALS, the checkpointDir must be set or check-pointing won't be effective [Ref. here.]

    0 讨论(0)
提交回复
热议问题