I am trying to speed up the Spark ML pipeline on databricks.
Currently, it cost 35 mins to train a SparkML pipeline for a dataframe with 2000 rows and 3 columns.
B