发表新帖

发表新帖

How to balance classification using DecisionTreeClassifier?

前端未结

关注

 2  1056

失恋的感觉 2021-02-15 13:41

I have a data set where the classes are unbalanced. The classes are either 0, 1 or 2.

How can I calculate the prediction error fo

2条回答

生来不讨喜 (楼主)

2021-02-15 13:54
If the frequency of class A is 10% and the frequency of class B is 90%, then the class B will become the dominant class and your decision tree will become biased toward the classes that are dominant

In this case, you can pass a dic {A:9,B:1} to the model to specify the weight of each class, like
```
clf = tree.DecisionTreeClassifier(class_weight={A:9,B:1})
```
The class_weight='balanced' will also work, It just automatically adjusts weights according to the proportion of each class frequencies

After I use class_weight='balanced', the record number of each class has become the same (around 88923)
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...

热议问题