I have a forum on a website I master, which gets a daily dose of pron spam. Currently I delete the spam and block the IP. But this does not work very well. The list of blocked IP’s is growing quickly, but so is the number of spam posts in the forum.
The forum is entirely my own code. It is built in PHP and MySQL.
What are some concrete ways of stopping the spam?
The thing I forgot to mention is that the forum needs to be open for unregistered users to post. Kinda like a blog comment.
Thank you for visiting the Q&A section on Magenaut. Please note that all the answers may not help you solve the issue immediately. So please treat them as advisements. If you found the post helpful (or not), leave a comment & I’ll get back to you as soon as possible.
In a guestbook app I wrote, I implemented two features which prevent most of the spam:
- Don’t allow POST as the first request in a session
- Require a valid HTTP Refer(r)er when posting
In my experience, the best easy defenses come from just doing something “non-standard”. If you make your site non-standard, this makes it so that any automated spam would have to be coded specifically for your site, which (no offense) probably isn’t worth the effort. Note that if the spam is coming from human spammers, there’s not really anything you can do that won’t also stop legitimate posters. So the goal is to find a solution that will throw away any “standard” posts – that is, “fill out the whole form and push submit”.
A couple examples that come to mind of things that you could try:
- Have a hidden form field with a name that sounds like something a spammer would want to fill out, like “website” or “homepage” or something like that. If the form field gets filled out, throw away the message instead of posting it, because it was a bot automatically filling in the whole form, even invisible fields.
- You don’t have to use a “real” captcha, but even something simple like “Enter the following word backwards: <random backwards word>” or “What is the domain name of this website?”. Easy for a human to do, but it would require a fairly complex bot to figure out what to fill in.
You might want to look at this question, which has several answers that describe how you could implement a non-intrusive captcha.
Another thing to consider is to require time between posts to prevent massive spamming.
Include a CAPTCHA that is always “orange”.
The spams may be by bots or humans – bots are more likely.
Don’t bother blacklisting IP addresses or using third party blacklists, that will just generate false positives. Almost all bots use the same IP addresses as (some) legitimate users.
Another trick is to put in a text field with a plausible sounding name, which is made difficult to see with CSS – anyone filling this field in with anything is considered to be a bot.
You can try your luck with non-standard form:
- fields that must stay empty hidden with CSS
- fields with misleading names, e.g.
<input name=email>for something that is not an e-mail.
For me CAPTCHA is like giving up to spammers and letting them damage your forum anyway – except that instead of spam damage, you get usability and accessibility damage.
Something I’ve found to be surprisingly effective: disallow comments that contain too many URLs (more than, say, 5). Since doing that, I’ve had zero comment spam.
Edit: Since writing the above, I’ve had recurring comment spam with only one link. I have now added some honeypot fields and have had no commend spam for a few months now.
Don’t let anybody post until they respond to an email sent to their registered email address. You’ll see lots of forums and mailing lists generate a unique email address or web url that is sent to the new user’s given email address, and they have to respond to the email or click on the link to finalize their registration.
Captcha is definitely the easiest method – try KittenAuth if you want something bot-proof (Although I got pandas this time)
There is no single answer since Spam is really a matter of economics: how much is it worth it to someone to put their stuff onto the web. There, however, some solutions that seem pretty good
- Use CCS to create an invisible
field that robots fill-in
- Create a time-specific hidden field in your form so the
robot can’t use the same form over and over again.
I want to say that in most time, a CAPTCHA is enough for you to prevent SPAMers.
But do use a strong one, like http://www.captcha.net/.
Remember that SPAMers do not want to spend much time to deal with a particular site(except heavy traffic sites), they use a tool to post AD on a lot of sites. So make your FORM a little unusual, (e.g. give the user a image says ‘1.5+2.4=?’ and let users to answer, this will block most of the spam tools 🙂 )
The easiest thing I’ve done to stop spammers with (so far) 100% consistency is to validate the text that was submitted. If you use the php function strstr() to check for “a href” or even a non-clickable http or www, you can then just reroute the spammer elsewhere. I actually have a script then write to my .htaccess file to deny the offending IP address. Not sure if there’s any other kind of spam to be concerned about, but links are all I’ve seen so far.