I have a ML project in which some data rows are missing data. In this case, I replace the missing data with the average of the column, add a nan_mask column for each column