Given a set of words tagged for part of speech, I want to find those that are obscenities in mainstream English. How might I do this? Should I just make a huge list, and check f
It depends what your text source is, but I'd go for some kind of established and proven pattern matching algorithm, using a Trie for example.