Google's Sly Battle on Comment Spam and Our Respective Policy #
It’s great to hear that Google has started a simple solidarity against web spam. The gist of it is:
<a rel="nofollow" href="http://spamjockey.com/">Spam Jockey!!</a>
You add the rel="nofollow"
to external links in your Wiki or blog. Google (and other cooperating search engines) ignore those links. In time, spammers will get the picture and leave Wikis and blogs alone. It’s a noble effort and I think it’s worthwhile.
For now, I’m not going to touch this, though. I want comments here to be weighed in Google’s search results. Our topic is just so specific and our readership rather small in contrast to so many sites.
When spam becomes a problem, though, external links here will be tagged with rel="nofollow"
. I’d like to couple it with a technique for approving links.
Jamis
For what it’s worth, I had an awful problem with comment spam when I was using MovableType. However, since moving to Rublog and using my own custom comment engine—with the poor-man’s captcha built into it—I get maybe one or two spam comments per month.
Don’t know if a captcha is more trouble than you’d like to bother with, though. Plus, I’m sure you have a much larger readership than I do, so you may be targetted more frequently by spammers.
sleeper
.. I’d like to couple it with a technique for approving links ..
Is this where the Bayesian Classifier you mentionned before could help ?
Speck
There was an interesting outlook on nofollow in a kuro5hin article recently.
why
Well, it’s like the guy on kuro5hin said, the
nofollow
stuff only kills the spammers who want PageRank.That’s why I’m saying, I’d probably have a process where:
nofollow
.nofollow
.The approval process would be very simple. A page with the unapproved comments listed. Check a box if it’s spam. Submit the page and the comments are all processed based on the checkbox states.
sleeper
Should the comment fail a bayesian filter,
You gave me the idea to work (for my pleasure) on a bayesian filter .. I’m not sure it will be as powerfull as already existing versions (in other languages though), but it’s very interesting to work on this kind of problem ;) Thnaks for this _why :)
daigo
How about bsfilter? This is targeted at mails, though.
sleeper
I wasn’t aware of bsfilter … I will have a look at it … In fact as it’s just a kind of ‘toy project’ for me, it can be interesting to see other real project implementations !
Comments are closed for this entry.