nltk wordpunct_tokenize vs word_tokenize

前端 未结 2 1523
-上瘾入骨i
-上瘾入骨i 2021-02-05 09:07

Does anyone know the difference between nltk\'s wordpunct_tokenize and word_tokenize? I\'m using nltk=3.2.4 and there\'s noth

2条回答
  •  慢半拍i
    慢半拍i (楼主)
    2021-02-05 09:56

    Word_tokenize is for tokenizing a word in a sentence while wordpunct_tokenize is to remove the non-English words in a sentence.

提交回复
热议问题