You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 24 Next »

Nightly Mass-Check Runs

What?

Nightly MassCheck runs are the way people submit data on the effectiveness of current rules on their recent spam and ham. It is used to generate the very rule scores that determine the effectiveness of SpamAssassin (distributed via sa-update), and to evaluate rules via the RuleQaApp. The accuracy of SpamAssassin is directly related to the number of people contributing to nightly MassChecks.

/*
Broken link:
(There's also an older, clunkier version of the analysis scripts running on DanielQuinlan's server; see http://www.pathname.com/~corpus .)
*/

How? (The Easiest Way)

If you rsync up your corpus to our server, as described in UploadedCorpora, it can be mass-checked there. Unfortunately you have to share your mail corpus with whoever might have access to that machine. It's not expected that anyone will ever actually look, but it's there nonetheless. If you are very concerned about privacy, you may be advised to strip out the more private mails before uploading, or mass-check on your own machine instead. (This is what I do --jm)

Details for PMC members on how to set up new accounts are at NewUploadedCorporaUser.

How? (most common, second easiest way)

It seems most people are using the auto-mass-check script.

How? (Less Easy, The Corpus-Nightly Script)

CorpusNightlyScript

How? (For Hackers, The DIY Version)

ManualNightlyMassCheck

  • No labels