发表新帖

发表新帖

What's the difference between predict_proba and decision_function in scikit-learn?

后端未结

关注

 2  469

余生分开走 2021-01-31 03:14

I\'m studying a scikit-learn example (Classifier comparison) and got confused with predict_proba and decision_function.

They plot the classific

2条回答

太阳男子 (楼主)

2021-01-31 03:30
Your example is
```
if hasattr(clf, "decision_function"):
    Z = clf.decision_function(np.c_[xx.ravel(), yy.ravel()])
else:
    Z = clf.predict_proba(np.c_[xx.ravel(), yy.ravel()])[:, 1]
```
so the code uses decision_function if it exists. On the SVM case, predict_proba is computed (in the binary case)

using Platt scaling

which is both "expensive" and has "theoretical issues". That's why decision_function is used here. (as @Ami said, this is the margin - the distance to the hyperplane, which is accessible without much further computation). In the SVM case, it is advised to

use decision_function instead of predict_proba.

There are other decision_functions: SGDClassifier's. Here, predict_proba depends on the loss function, and decision_function is universally available.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...

热议问题