Re: Bayesian filtering (Re: Suggentions for server side spam control)

From: Oded Arbel (oded-linux_at_nonexisting.hamakor.org.il)
Date: Thu 01 Jan 2004 - 12:15:20 IST


ביום חמישי 01 ינואר 2004, 00:34, נכתב על ידי Gil Freund:
> > occasionally scan user's inboxes by grepping for known keywords to
> > extract SPAM that they got and then feeds it to the dictionary. I also
> > have some dummy accounts which exist for the sole purpose of attracting
> > SPAM.
>
> How do you feed it? I thought SA reads MBOX and Maildir formats only?

I don't use SA - I use bogofilter (see my previous message), which likes mboxs
(not Maildir though) but can also cooperate with STDIN.

I actually have two mail targets which gobbles everything sent to them and
feed it to bogofilter's dictionary as either SPAM or HAM respectivly. I
almost never use them though because bogofilter also classifies IP addresses
and I fear it might classify the IP of the mail server itself (which will of
course appear in all the emails) as a SPAM source.

-- 
Oded
::..
We're programmers. Programmers are, in their hearts, architects, and the first 
thing they want to do when they get to a site is to bulldoze the place flat 
and build something grand. We're not excited by incremental renovation: 
tinkering, improving, planting flower beds.
	-- Joel Spolsky / "Things you Should Never Do, Part I"
================================To unsubscribe, send mail to linux-il-request_at_linux.org.il with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail linux-il-request_at_linux.org.il


This archive was generated by hypermail 2.1.7 : Thu 01 Jan 2004 - 12:29:20 IST