发表新帖

发表新帖

How can I split a text into sentences?

前端未结

关注

 13  1012

傲寒 2020-11-22 06:33

I have a text file. I need to get a list of sentences.

How can this be implemented? There are a lot of subtleties, such as a dot being used in abbreviations.

13条回答

既然无缘 (楼主)

2020-11-22 06:39
You can try using Spacy instead of regex. I use it and it does the job.
```
import spacy
nlp = spacy.load('en')

text = '''Your text here'''
tokens = nlp(text)

for sent in tokens.sents:
    print(sent.string.strip())
```
0 讨论(0)

查看其它13个回答
发布评论:

提交评论
- 加载中...

热议问题