Need help with regex filter.

Cowboy · Guest

I need a filter that will weed out comments placed in the middle of a word.

It would delete:
Buy my delicious sp<!kfgkh8899>am.

But it would not delete:
Buy my delicious <!kfgkh8899>spam.

I can not write the filter for myself, so if someone could help me with this it would do a lot for my spam filtering.

Thanks! Very Happy

stan_qaz · Posted: Sat Nov 22, 2003 1:09 pm Post subject:

Go to the search function and select the firetrust catagory and search on html, you will find plenty of discussion of the topic and several suggestions for filters.

Cowboy · Guest

There is no filter like I need. At least not that I can find.
The closest is the filter that counts the comments but I think that is not what I need.

stan_qaz · Posted: Sat Nov 22, 2003 5:26 pm Post subject:

That is as good as it is going to get, the problem isn't easy to solve as you saw from the posts you looked at.

Are you willing to pay to have a filter written? Make an offer and see if someone is willing to tackle the problem for some cash.

If not chip into the threads asking for a processed message option for the filters, the fix I like best.

Cowboy · Guest

Nonsense. It can not be as good as it gets until someone tries to write the filter. Nobody has tried yet!

denn988 · Guest

Ikeb · Posted: Sun Nov 23, 2003 1:12 am Post subject:

Cowboy · Guest

I have tried. I've read the help files. I've tried to put together the parameters to make such a filter. I've sat for hours trying everything I can think of. Never once did I get it to work. So I decided I needed help. All I got was a bunch of comments about as useful as my filter attempts.

In the best of worlds I would have gotten a "That's a good idea to filter out just html comments that are used to disguise words, instead of trying to count the comments. Here's your filter.", or I would get a "That's a bad idea because it's impossible to make such a filter." Or at least not a thread bogged down by the assume patrol.

denn988 · Guest

As long as you have already tried....

See if this will help:

The body....
contains Regular Expr...

denn988 · Guest

Cowboy,

I have had a day to see how the above filter works and it looks pretty good so far.

There are a couple of mods that I have made to it that have improved its trap rate.

Change the above RegExp to:

Ikeb · Posted: Mon Nov 24, 2003 1:54 am Post subject:

Denn988, thanks for another good one.

Do you think it's OK to have the filter fire on a single hit?

Also, instead of the [^<] negation, why not use [^>] since it's the closing ">" bracket that will follow this part of the match?

denn988 · Guest

Ikeb · Posted: Mon Nov 24, 2003 9:45 am Post subject:

denn988 · Guest

Guest · Posted: Mon Nov 24, 2003 10:22 am Post subject:

Sorry,

I forgot to turn th e HTML off when I posted

Those examples, if sent as HTML, would appear in the raw text as:

1 0 & l t ; 2 0 & l t ; 3 0

and

3 0 & g t ; 2 0 & g t ; 1 0

I had to place spaces between each charactor above to get them to post.

The brackets must be sustituted when converting them to the HTML raw text in order to keep the translator from being confused.[/quote]


	Home ˇ Topics ˇ Submit News ˇ Top 10 This entire site, Cops Themes, and Computer Cops are Š 2002 - 2004 Computer Cops, LLC. All rights reserved. You can syndicate our news using the file RSS 0.91, ultramode.txt, or RSS 1.0. Acceptable Use Policy. Use signifies your agreement. Engine Copyright Š 2002 by PHP-Nuke, GNU/GPL Licensed. ICRA Member. Paul Laudanski, Member of Computer Cops, LLC Server Load: 820 pages served in previous 5 minutes. Page Rendered: 0.867 seconds.