Normalize data before or after split of training and testing data?
问题 I want to separate my data into train and test set, should I apply normalization over data before or after the split? Does it make any difference while building predictive model? Thanks in advance. 回答1: You first need to split the data into training and test set (validation set might also be required). Don't forget that testing data points represent real-world data. Feature normalization (or data standardization) of the explanatory (or predictor) variables is a technique used to center and