Python stemmer issue: wrong stem

后端 未结 1 1275
小鲜肉
小鲜肉 2021-01-29 10:58

Hi i\'m trying to stem words with a python stemmer, i tried Porter and Lancaster, but they have the same problem. They can\'t stem correclty words that end with \"er\" or \"e\".

1条回答
  •  深忆病人
    2021-01-29 11:27

    To quote the page on Wikipedia, In computational linguistics, a stem is the part of the word that never changes even when morphologically inflected, whilst a lemma is the base form of the word. For example, given the word "produced", its lemma (linguistics) is "produce", however the stem is "produc": this is because there are words such as production. So your code is likely giving you correct results. You seem to expect a lemma which is not what a stemmer produces (except when the lemma happens to equal the stem)

    0 讨论(0)
提交回复
热议问题