I need the most exhaustive English word list I can find for several types of language processing operations, but I could not find anything on the internet that has good enough q
There aren't too many base words(171k according to this- oxford. Which is what I remember being told in my CS program in college. But if include all forms of the words- then it rises considerably.
That said, why not make one yourself? Get a Wikipedia dump and parse it and create a set of all tokens you encounter.
Expect misspellings though- like all things crowd-sources there will be errors.