saving prediction from pyspark into HDFS is very slow on yarn (cluster or client)

后端 未结 0 1182
一个人的身影
一个人的身影 2021-02-02 06:53
df = rf_model.transform(df).select("id", "probability", "prediction")

df = df.withColumn("prediction", df["prediction"         


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题