I have a text file. I need to get a list of sentences.
How can this be implemented? There are a lot of subtleties, such as a dot being used in abbreviations.
You can try using Spacy instead of regex. I use it and it does the job.
import spacy nlp = spacy.load('en') text = '''Your text here''' tokens = nlp(text) for sent in tokens.sents: print(sent.string.strip())