Natural Language Processing: Find obscenities in English?

后端 未结 11 1270
自闭症患者
自闭症患者 2021-02-09 21:15

Given a set of words tagged for part of speech, I want to find those that are obscenities in mainstream English. How might I do this? Should I just make a huge list, and check f

11条回答
  •  南笙
    南笙 (楼主)
    2021-02-09 22:13

    I would advocate a large list of simple regex's. Smaller than a list of the variants, but not trying to capture anything more than letter alternatives in any given expression: like "f[u_-@#$%^&*.]ck".

提交回复
热议问题