You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

The "sought" ruleset

Our spamtrap network collects multiple hundreds of megabytes of spam per day. Wouldn't it be great if there was a way to feed that directly into a script to automatically extract rules?

This is now possible, and the results are the "sought.cf" ruleset – an automatically-generated ruleset which seeks good rules directly from the SpamAssassin spamtraps, updated every 4 hours.

[http://taint.org/2007/08/15/004348a.html Here are instructions on how to use it].

Gory Details

If you're curious, [http://taint.org/2007/03/05/134447a.html here is a technical explanation of the algorithm used], and [http://taint.org/2007/08/04/200125a.html here is an examination of their efficiency against our test corpora].

  • No labels