Is there a way to persist or save the pipeline model in pyspark 1.6?

问题

I understand that this is a duplicate question which was asked here saving pipeline model in pyspark 1.6 but there is still no definite answer to it. Can anyone please suggest anything?

joblib or cPickle doesn't work as it gives the same error which is given in the previous link. Is there a way to save the pipeline in PySpark 1.6 or there isn't? The questions that I saw regarding model persistence were mainly related to persisting ML models. Saving a pipeline is the altogether differnt issue. Is there any hack that is available? I need the Python (v2.7) implementation. Any help is appreciated. I am using the RandomForestClassifier from pyspark.ml as a classification algorithm and my environment is Spark 1.6 and Python 2.7 (if this is of any help)

来源：https://stackoverflow.com/questions/43111624/is-there-a-way-to-persist-or-save-the-pipeline-model-in-pyspark-1-6

标签

python

pyspark

pipeline

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!