I am currently building a logit regression model on Apache Spark using PySpark (no choice over platform). I have used the GeneralizedLinearRegression model, where the coefficien