How to use scikit's preprocessing/normalization along with cross validation?

前端 未结 1 1015
生来不讨喜
生来不讨喜 2021-02-19 07:02

As an example of cross-validation without any preprocessing, I can do something like this:

    tuned_params = [{\"penalty\" : [\"l2\", \"l1\"]}]
    from sklearn         


        
1条回答
  •  自闭症患者
    2021-02-19 07:33

    Per the documentation, if you employ Pipeline, this can be done for you. From the docs, just above section 3.1.1.1, emphasis mine:

    Just as it is important to test a predictor on data held-out from training, preprocessing (such as standardization, feature selection, etc.) and similar data transformations similarly should be learnt from a training set and applied to held-out data for prediction [...] A Pipeline makes it easier to compose estimators, providing this behavior under cross-validation[.]

    More relevant information on pipelines available here.

    0 讨论(0)
提交回复
热议问题