procmail
[Top] [All Lists]

Re: recipe to dump chinese spam

2000-01-06 23:29:29
"John D. Hardin" <jhardin(_at_)wolfenet(_dot_)com> writes:
On Wed, 5 Jan 2000, era eriksson wrote:

  ### WARNING: I have substituted \200 and \377 for literal ASCII 128
  ###  and 255, respectively, in the copy I post to the list
  ###  (not in the next comment, though; it's supposed to be human-readable)
    :0 # four or more of \200-\377 in a row in Subject or From
    * ^\/(From|Subject):.*[\200-\377][\200-\377][\200-\377][\200-\377]

I'm trying much the same thing in a weighted recipe (e.g. if more than
100 high-bit characters, bounce it), but the pattern matching is
failing badly. For this:

:0
* -100^0
* 1^1 [\200-\376]      (again, the recipe has the literal characters)

I'm getting a score of 9 on a 200+ character message body. Trying
things like adding the D flag and using [^\001-\177] instead also
don't work.

Is the RE engine broken for high-bit characters?

Did you mean to include the 'B' flag?


Philip Guenther

<Prev in Thread] Current Thread [Next in Thread>