procmail
[Top] [All Lists]

Re: rule to catch a certain number of characters

2007-05-26 13:53:09
Dallman,

thanks for taking that challenge ...

I sent you a sample of those mails off list, for reference ...

In an older episode (Saturday, 26. May 2007 21:54), Dallman Ross wrote:
So, how would I - not familiar with scoring so far - match that
long line?

The starting place would be "man procmailsc",

OK.

but this is 
an interesting challenge, and I will try to help more specifically
now.

Gotcha ;)
Thanks.

I have an idea.  Can you imagine a legitimate message
with even a 1,000-char title string without whitespace?  I can't.
So why not trash at that level instead of looking for 100,000?

Agreed (actually, I can't think of legitimate <title>1,000 chars</title> 
messages at all, so maybe we don't even have to stick to non-space 
characters there?).

First of all, let's not bother unless it's that Content-Type.
Then, let's not bother unless it's bigger than 100K.

  SPACE = ' '
  TAB   = '   '
  WS    = $SPACE$TAB

Why $SPACE$TAB? Doesn't that mean a space followed by a tab?

  :0:

  * ^Content-Type:.*/html
  *   B ?? > 100000
  * $ B ?? $xWS1152.*$*.*<\title>
  spampile

Is <\title> a typo for </title>?


Actually, we could look for the full 100K if we wanted to now
without needing more LINEBUF. 

But this seems unnecessary to me. 

Same here.

Cheers,

wolfgang


____________________________________________________________
procmail mailing list   Procmail homepage: http://www.procmail.org/
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail