New User? Need help? Click here to register for free! Registering removes the advertisements.

Computer Cops
image image image image image image image image
Donations
If you found this site helpful, please donate to help keep it online
Don't want to use PayPal? Try our physical address
image
Prime Choice
· Head Lines
· Advisories (All)
· Dnld of the Week!
· CCSP News Ltrs
· Find a Cure!

· Ian T's (AR 24)
· Marcia's (CO8)
· Bill G's (CO12)
· Paul's (AR 5)
· Robin's (AR 2)

· Ian T's Archive
· Marcia's Archive
· Bill G's Archive
· Paul's Archive
· Robin's Archive
image
Security Central
· Home
· Wireless
· Bookmarks
· CLSID
· Columbia
· Community
· Downloads
· Encyclopedia
· Feedback (send)
· Forums
· Gallery
· Giveaways
· HijackThis
· Journal
· Members List
· My Downloads
· PremChat
· Premium
· Private Messages
· Proxomitron
· Quizz
· RegChat
· Reviews
· Google Search
· Sections
· Software
· Statistics
· Stories Archive
· Submit News
· Surveys
· Top
· Topics
· Web Links
· Your Account
image
CCSP Toolkit
· Email Virus Scan
· UDP Port Scanner
· TCP Port Scanner
· Trojan TCP Scan
· Reveal Your IP
· Algorithms
· Whois
· nmap port scanner
· IPs Banned [?]
image
Survey
How much can you give to keep Computer Cops online?

$10 up to $25 per year?
$25 up to $50 per year?
$10 up to $25 per month?
$25 up to $50 per month?
More than $50 per year?
More than $50 per month?
One time only?
Other (please comment)



Results
Polls

Votes: 1159
Comments: 21
image
Translate
English German French
Italian Portuguese Spanish
Chinese Greek Russian
image
 Forum FAQForum FAQ   SearchSearch   UsergroupsUsergroups   ProfileProfile   Login to check your private messagesLogin to check your private messages   LoginLogin 

Everything is spam?

 
Post new topic   Reply to topic       Computer Cops Forum Index -> Mailwasher - Troubleshooting / General
View previous topic :: View next topic  
Author Message
m4gician

Cadet
Cadet



Joined: Jun 01, 2004
Posts: 2
Location: Uk

PostPosted: Tue Jun 01, 2004 11:03 am    Post subject: Everything is spam?
Reply with quote

I have been using Mailwasher for a while now but recently it has developed a fault. Every email is listed as "Probable spam" or "known Spam" even when I know who is sending it. I want to teach MW to automatically bounce and delete known spam or probable spam but that's dangerous now because it thinks *everything* is spam. Also, in the 'Learning' column, only the envelope is visibale and the bin has disappeared.

If anyone can help me get back to normal, it would be mucha appreciated.

M4gician
Back to top
View users profile Send private message
rogerw

Major
Major
Premium Member
Premium Member


Joined: May 11, 2003
Posts: 857
Location: USA

PostPosted: Tue Jun 01, 2004 11:26 am    Post subject:
Reply with quote

Sounds like you really don't have an appreciation of what the Bayesian (learning) filters can do for you and how they work.

From your description, you've 'trained' the feature with a great number of 'junk' emails and not enough 'legitiamte' emails so that MW is tending to score everything as 'junk'.

If you turn the learning feature off, you'll get back the prior functionality. To turn it off, navigate Tools>Options>Learning then unset the Checkboxes on each of the 3 property sheets there.


Once the feature is off, you can read up on the Bayesian to see how it works and if you want to try using it later.
Back to top
View users profile Send private message
m4gician

Cadet
Cadet



Joined: Jun 01, 2004
Posts: 2
Location: Uk

PostPosted: Wed Jun 02, 2004 7:06 am    Post subject:
Reply with quote

The learning feature was fine and I was teaching it well but then it went haywire a week or so ago. Is it simply a case of training it with too many junk emails?

Do you happen to know why the dustbin icon has vanished from the learning column, leaving only the envelope?
Back to top
View users profile Send private message
Ikeb

General
General
Premium Member
Premium Member


Joined: Apr 20, 2003
Posts: 3555
Location: Canada

PostPosted: Wed Jun 02, 2004 9:38 am    Post subject:
Reply with quote

I had a similar thing happen but admittedly not the same implications. In my case, after several weeks of good Learning Spam Tool (LST) performance, for over a week most msgs were being left as "Unknown". I just kept training the LST and it is now behaving itself.

The 'envelope' (on my screen and with my poor eyesight I can't even make out the symbol) signifies an opportunity to "train" the LST to consider the msg as Legit. That the "trashbin" is missing indicates that the msg is currently considered Junk already and there's no need to train as Junk.

_________________
I like SPAM ... on my sandwich!
Back to top
View users profile Send private message Send email
rogerw

Major
Major
Premium Member
Premium Member


Joined: May 11, 2003
Posts: 857
Location: USA

PostPosted: Wed Jun 02, 2004 10:34 am    Post subject:
Reply with quote

m4gician wrote:
Is it simply a case of training it with too many junk emails?

That is likely the case.

The Learing tool doesn't match phrases in email like the filter tool does. Rather, it gathers statistics on individual words within what you train as 'good' mail and 'junk' - how freqently they occur in junk mail and legit mail - then evaluates incoming mail based upon a weighting factors of the same words in the incoming mail.

If you train with a disproportionate number of junk mails with respect to good mail, then the 'weighting' of all the words in the data built up will favor things being classified as junk.

When this happens, you'll need to add to the 'legitimate' training so that MW has a better statistical sample of junk/legit and a better database of words to better make choices for you.
Back to top
View users profile Send private message
stan_qaz

General
General
Premium Member
Premium Member


Joined: Mar 31, 2003
Posts: 4112
Location: USA

PostPosted: Wed Jun 02, 2004 11:34 am    Post subject:
Reply with quote

You might have a file corruption problem too, close MW and delete the training.dat file, MW will recreate it next time you start up.

If that doesn't help try deleting all the training files and start over.
Back to top
View users profile Send private message Visit posters website
stapel

Trooper
Trooper



Joined: Apr 23, 2004
Posts: 19
Location: USA

PostPosted: Fri Jun 04, 2004 12:45 pm    Post subject:
Reply with quote

I've had periodic problems with Mailwasher 4.1 suddenly marking obvious spam as "Probably Legitimate" (even if it is letter-for-letter the same as another message marked "Known Spam"), or not marking the "Known Spam" for blacklisting and deletion. But when I delete the Mailwasher "Training" files, this seems to reset Mailwasher to sensible behavior for another couple weeks. So try deleting the "Training" files and see if this helps.

Just my $0.02.

Eliz.
Back to top
View users profile Send private message Visit posters website
AlphaCentauri

Captain
Captain



Joined: Nov 20, 2003
Posts: 302
Location: USA

PostPosted: Tue Jun 08, 2004 2:24 pm    Post subject:
Reply with quote

rogerw wrote:
If you train with a disproportionate number of junk mails with respect to good mail, then the 'weighting' of all the words in the data built up will favor things being classified as junk.


Does that mean that if I get spam that has only a few quotes from Bartlett's quotations and links to an image and a URL, I should tell the learning filter that it is legit?
Back to top
View users profile Send private message
rogerw

Major
Major
Premium Member
Premium Member


Joined: May 11, 2003
Posts: 857
Location: USA

PostPosted: Tue Jun 08, 2004 3:01 pm    Post subject:
Reply with quote

AlphaCentauri wrote:
Does that mean that if I get spam that has only a few quotes from Bartlett's quotations and links to an image and a URL, I should tell the learning filter that it is legit?
Certainly not! ... and the uncommon words that might be culled from dictionaries aren't the problem.

I know you're pointing out that an image-based spam with a few words of text will be hard for Bayesian to classify in the first place - and it's a good point to make for those less familiar. The other tools may be better at trapping such an email.

For the benefit of that same group, I'll restate to:

As a statistical method of classifying, there needs to be a statistically significant sampling of good and junk mails so that common words (ones that appear in both junk and legit emails) don't wind up with a weighting that will tend to skew the assignments. Training with practically all junk and few legit mails will not allow the occurance of common words in in the good mail offset the junk, and you'll wind up having a majority of emails classified as junk.
Back to top
View users profile Send private message
stan_qaz

General
General
Premium Member
Premium Member


Joined: Mar 31, 2003
Posts: 4112
Location: USA

PostPosted: Wed Jun 09, 2004 10:13 am    Post subject:
Reply with quote

I just train anything that isn't classified and retrain anything that is misclassified with very good results.

Haven't had a misclassification in some time now.
Back to top
View users profile Send private message Visit posters website
Display posts from previous:   
Post new topic   Reply to topic       Computer Cops Forum Index -> Mailwasher - Troubleshooting / General All times are GMT - 5 Hours
Page 1 of 1

 
 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Powered by phpBB 2.0.8a © 2001 phpBB Group

Version 2.0.6 of PHP-Nuke Port by Tom Nitzschner © 2002 www.toms-home.com
Version 2.2 by Paul Laudanski © 2003-2004 Computer Cops