How do I remove duplicate words from a list in python without using sets?

后端未结

关注

 8  1638

渐次进展 2021-02-11 04:24

I have the following python code which almost works for me (I\'m SO close!). I have text file from one Shakespeare\'s plays that I\'m opening: Original text file:

\"Bu

8条回答

深忆病人 (楼主)

2021-02-11 04:53

Use plain old lists. Almost certainly not as efficient as Counter.

fname = raw_input("Enter file name: ")  

Words = []
with open(fname) as fhand:
    for line in fhand:
        line = line.strip()
        # lines probably not needed
        #if line.startswith('"'):
        #    line = line[1:]
        #if line.endswith('"'):
        #    line = line[:-1]
        Words.extend(line.split())

UniqueWords = []
for word in Words:
    if word.lower() not in UniqueWords:
        UniqueWords.append(word.lower())

print Words
UniqueWords.sort()
print UniqueWords

This always checks against the lowercase version of the word, to ensure the same word but in a different case configuration is not counted as 2 different words.

I added checks to remove the double quotes at the start and end of the file, but if they are not present in the actual file. These lines could be disregarded.

0 讨论(0)

查看其它8个回答