发表新帖

发表新帖

How to include words as numerical feature in classification

前端未结

关注

 3  752

眼角桃花 2021-02-06 11:40

Whats the best method to use the words itself as the features in any machine learning algorithm ?

The problem I have to extract word related feature from a particular p

3条回答

别那么骄傲 (楼主)

2021-02-06 12:41

Standard approach is the "bag-of-words" representation where you have one feature per word, giving "1" if the word occurs in the document and "0" if it doesn't occur.

This gives lots of features, but if you have a simple learner like Naive Bayes, that's still OK.

"Index in the dictionary" is a useless feature, I wouldn't use it.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题