Currently best spam filter algorithm

前端 未结 4 1022
暖寄归人
暖寄归人 2021-01-01 03:24

What is the currently best method to detect spam ? especially on mobile text message. are there any resource or comparison analysis ?

4条回答
  •  孤城傲影
    2021-01-01 04:20

    It's good to look into supervised learning techniques. There've been a number of studies where the Multinomial Naive Bayes Classifier has been used for spam email filtering with a lot of success. If it worked for spam email filtering, then it should work with SMS filtering. What you need is a huge dataset of example spam SMS texts and train the classifier with it.

    Also, it may be helpful to look into the Support Vector Machine, which; although less widely used in spam filtering; is a much more powerful technique.

    Also, just training the algorithms on raw text may not quite be the best way forward. There was a study by Mehran Sahami from 1998 that found that they achieved superior performance when they took other heuristics into consideration (e.g. was the email sent to a mailing list? was the email sent from a domain name that ended in either ".edu",".com",".org"? did the email contain multiple punctuation marks ("!!!")?, and so forth).

    But start off with the Multinomial Naive Bayes Classifier. It's very simple to implement, and it's very easy to use, and from personal experience: it has a very short training time, as well.

提交回复
热议问题