How to split Text into paragraphs using NLTK nltk.tokenize.texttiling?
问题 I found this Split Text into paragraphs NLTK - usage of nltk.tokenize.texttiling? explaining how to feed a text into texttiling, however I am unable to actually return a text tokenized by paragraph / topic change as shown here under texttiling http://www.nltk.org/api/nltk.tokenize.html. When I feed my text into texttiling, I get the same untokenized text back, but as a list, which is of no use to me. tt = nltk.tokenize.texttiling.TextTilingTokenizer(w=20, k=10,similarity_method=0, stopwords