I have a data frame of 217 columns. Till now I was sampling it by giving a specific number. The sampled data-frame was then used to apply a LinearRegression model to predict