spam-prevention | 易学教程

How can I add items to collection.Counter? and then sort them into ASC?

阅读更多关于 How can I add items to collection.Counter? and then sort them into ASC?

问题 At the moment I'm trying to process lingspam dataset by counting the occurance of words in 600 files (400 emails and 200 spam emails). I've already made each word universal with the Porter Stemmer Aglorithm, I would also like for my result to be standardized across each file for further processing. But I'm unsure on how I can accomplish this.. Resources thus far 8.3. collections — Container datatypes How to count co-ocurrences with collections.Counter() in python? Bag of Words model In order

Simple techniques for preventing spamming of a web chat application

阅读更多关于 Simple techniques for preventing spamming of a web chat application

问题 I have a simple, custom rolled chat here: ( http://ninjawars.net - essentially: ajax chat, php backend, javascript listing of chat messages, logged-in user input only ) that suffers from being able to be spammed. What are some simple systems to prevent spamming of a chat? One thing (lowest level of protection) that I have already implemented: Ignore consecutive duplicate messages from the same user. Other ideas that I have: Add consecutive messages from the same user together, instead of

Blocking bad requests with .htaccess file

阅读更多关于 Blocking bad requests with .htaccess file

问题 I have a site that is getting hit by users hitting variations of this URL /index.php?s= I am trying to block all requests to the following string which should get rid of all the spammers php?s= The problem I seem to be having is the ? and = in the htaccess file are protected characters and I just can't get the syntax correct. How can I use php?s= in the following scenarios? I have tried <IfModule mod_rewrite.c> RewriteEngine On RewriteCond %{REQUEST_URI} ^.(*php?s=).* [NC] RewriteRule ^(.*)$

Apache shutting down unexpectedly

阅读更多关于 Apache shutting down unexpectedly

问题 I have a mongrel server running behind Apache. It works fine; however, every now and then the Apache server shuts downs seemingly by itself. I'm not sure if there is configuration issue or if it's an attack. Here is Apache error log: [Thu Apr 30 02:15:07 2009] [notice] SIGHUP received. Attempting to restart [Thu Apr 30 02:15:07 2009] [warn] NameVirtualHost *:0 has no VirtualHosts [Thu Apr 30 02:15:07 2009] [notice] Apache/2.2.3 (Debian) PHP/5.2.0-8+etch13 configured -- resuming normal

Negative Captchas - help me understand spam bots better

阅读更多关于 Negative Captchas - help me understand spam bots better

问题 I have to decide a technique to prevent spam bots from registering my site. In this question I am mainly asking about negative captchas. I came to know about many weaknesses of bots but want to know more. I read somewhere that majority of bots do not render/support javascript. Why is it so? How do I test that the visiting program can't evaluate javascript? I started with this question Need suggestions/ideas for easy-to-use but secure captchas Please answer to that question if you have some

Php: Check if email content is spam

阅读更多关于 Php: Check if email content is spam

问题 I'm creating a newsletter feature that will allow users to send emails. Since there are malicious people out there who would want to send spam, I'm wanting to be able to check and see if the message created is spam or not. I've looked at a couple different methods like trying spam assassin but you need the full email which I won't have until later. Or you need to install some other utilities like spamd, but I'm looking for a php class that does this for me anyone know of anything? Or am i

How SpamAssassin defines BAYES_99 ? How to avoid it?

阅读更多关于 How SpamAssassin defines BAYES_99 ? How to avoid it?

问题 SpamAssassin marks my email with reason BAYES_99 , which means the email is 99% ~ 100% spam email. What is the mechanism of SpamAssassin to detect this rule (by keyword?), and how can I avoid hitting this rule? 回答1: BAYES_99 is a statistical component of SpamAssasin. It is using some sort/variation of Naive Bayes which has (to keep it simple) some sort of probability saying that some word/token is likely to be spam or not to be spam. In "math" these could be expressed like this: P(Class_Spam

Does using CSRF form tokens help spam prevention?

阅读更多关于 Does using CSRF form tokens help spam prevention?

问题 I am trying to stop bots from (potentially) submitting fake data to my php registration file. I am creating a site that uses a signup/login system and I want to add email verification which I am capable of. However the problem is my webhost only allows x amount of emails per minute, if a bot were to spam this not only will my database be filled with spam accounts but I will also be suspended for breaking the email limit. I have been reading up about securing forms and CSRF came up, a term I

X-Message-Delivery in Hotmail Messages

阅读更多关于 X-Message-Delivery in Hotmail Messages

问题 Recently I stared deciphering the hotmail inbox/junk folder placement. After long searching I found out that Hotmail/Outlook is using their X-Message-Delivery tag they input in each mail, to determine alot of things. If you go to source of an e-mail you can see an X-Message-Delivery similar to this : Vj0xLjE7dXM9MDtsPTE7YT0wO0Q9MjtHRD0yO1NDTD00 which if you BASE64-Decode it you get V=1.1;us=0;l=1;a=0;D=2;GD=2;SCL=4 Based upon my research, I found out that following things are causing a

Malicious crawler blocker for ASP.NET

阅读更多关于 Malicious crawler blocker for ASP.NET

问题 I have just stumbled upon Bad Behavior - a plugin for PHP that promises to detect spam and malicious crawlers by preventing them from accessing the site at all. Does something similar exist for ASP.NET and ASP.NET MVC? I am interested in blocking access to the site altogether, not in detecting spam after it was posted. EDIT: I am interested specifically in solutions that will detect access patterns to the site - these would prevent screen scraping the site as a whole, or at least make it a