SoC improve the Spam-X plugin

From GeeklogWiki
Revision as of 09:26, 6 March 2010 by Dirk (talk | contribs) (typos and minor corrections)

Jump to: navigation, search
(This is an idea page for the Google Summer of Code)

Introduction

Comment spam doesn't need an introduction - pretty much every site gets it. Geeklog ships with its own spam filter, called Spam-X. This filter can easily be extended by adding modules so that it can either be updated for the spammer's latest tricks or to add support for new anti-spam services. This project is about the latter - creating modules for new services.


Incentive

Geeklog currently ships with a Spam-X module for LinkSleeve (aka SLV). At the time, this was the only free service available that didn't require creating an account, so that it is usable "out of the box".

Over time, more anti-spam services have appeared. One goal of this project is to evaluate these services. And to be able to do that, we would need Spam-X modules to support these services.

The second half of this project is then about creating a new anti-spam service (see details below) and compare it with the existing services.


Part 1: New modules for existing services

Services

Here's a quick rundown of some existing anti-spam services:

Akismet

Akismet is associated with the WordPress blog platform. The service initially required a wordpress.com account, which made it not suitable for use in Geeklog (asking our users to sign up with a competitor's website would have looked odd). This requirement has since been dropped: You still need to sign up but can do so now from the Akismet homepage. The service is free (commercial options available).

There is already an older version of an Akismet module for Spam-X. It will probably need a review to check for API changes.

Defensio

Defensio is a service owned by security firm Websense. The service requires signup and is free "for all personal bloggers" (commercial options available).

Mollom

Mollom is loosely associated with the Drupal CMS. It requires signup and offers both free and for-pay services.

One specialty of Mollom is that it has an "unsure" categorization for posts where it's not quite sure yet whether the post is spam or not. In this case, it displays a CAPTCHA, so that the poster will have to confirm that they are human.

TypePad AntiSpam

TypePad AntiSpam is, as the name implies, associated with the TypePad CMS. It is currently (still) in beta. The service requires signup and is free ("and will always be free" -- quote from the website).

Stop Forum Spam

Stop Forum Spam is a private (one-man?) project. It doesn not require signup and is free.

The API has an option to check for a poster's user name, which at first glance doesn't seem like a reliable criterion to detect spam (the API has other options as well). Michael Hampton (author of Bad Behavior) also expressed some doubts about the service.

Discussion

(more to come)

Part 2: SWOT

TBD

Level of Difficulty

easy to medium

Implementing modules for the existing services should be relatively straightforward. Some more thought and work will be required for the SWOT implementation.


Further Reading

TBD