CountVectorizer: “I” not showing up in vectorized text

前端未结

关注

 2  1851

长情又很酷 2021-02-04 10:02

I\'m new to scikit-learn, and currently studying Naïve Bayes (Multinomial). Right now, I\'m working on vectorizing text from sklearn.feature_extraction.text, and for some reason

2条回答

孤城傲影 (楼主)

2021-02-04 10:51
This is because capital letter detection is by default turned off lowercase=True in CountVectorizer

Use
```
vectorizer_train = CountVectorizer(min_df=0, lowercase=False)
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...