I have two files train.csv and test.csv, I am trying to clean the datasets and apply feature engineering on both of them separately. Suppose in train.csv I have a column du