I have a two sentences containing duplicate words, for example, the input data in file my_text.txt:
my_text.txt
The Unix and Linux operating system.