I have implemented Ridge Regression using PySpark. I use it on a dataset which size is approximately 1.7 millions rows and 231 columns. It occupies more or less 1 GB. My imp