How is the “hash” function used when splitting a dataset into ML training and test set in Pyspark?

前端 未结 0 1531
醉话见心
醉话见心 2021-01-24 05:05

I am going through a tutorial on how to apply ML in Pyspark. In particular, the tutorial starts with a dataset named iris_dataset, which has 150 rows, each represen

相关标签:
回答
  • 消灭零回复
提交回复
热议问题