Problematic Random Forest training runtime when using formula interface

前端 未结 2 1065
挽巷
挽巷 2021-02-09 01:50

Running the Random Forest example from http://www.kaggle.com/c/icdar2013-gender-prediction-from-handwriting/data, the following line:

forest_model <- randomFo         


        
2条回答
  •  庸人自扰
    2021-02-09 02:19

    Found the problem, using formula in randomForest has created a tremendous performance degradation.

    More on this and how to estimate random forest running time can found in: https://stats.stackexchange.com/questions/37370/random-forest-computing-time-in-r and in http://www.gregorypark.org/?p=286

    Here is final code:

    forest_model <- randomForest(y=train$male, x=train[,-2], ntree=10000,do.trace=T)
    

提交回复
热议问题