Splitting sentences with nltk while preserving quotes

后端未结

关注

 2  1601

囚心锁ツ 2021-02-15 13:50

I am using nltk to split a text into sentence units. However, I need the sentences that contain quotes to be extracted as a single unit. Right now each sentence, even if it is w

2条回答

再見小時候 (楼主)

2021-02-15 14:34
Just change your print statement to this:
```
print ' '.join(tokenizer.tokenize(text, realign_boundaries=True))
```
This will join the sentences with a space instead of \n-----\n.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...