procmail
[Top] [All Lists]

Re: recipe to dump chinese spam

2000-01-06 22:49:40
On Wed, 5 Jan 2000, era eriksson wrote:

  ### WARNING: I have substituted \200 and \377 for literal ASCII 128
  ###  and 255, respectively, in the copy I post to the list
  ###  (not in the next comment, though; it's supposed to be human-readable)
    :0 # four or more of \200-\377 in a row in Subject or From
    * ^\/(From|Subject):.*[\200-\377][\200-\377][\200-\377][\200-\377]

I'm trying much the same thing in a weighted recipe (e.g. if more than
100 high-bit characters, bounce it), but the pattern matching is
failing badly. For this:

:0
* -100^0
* 1^1 [\200-\376]      (again, the recipe has the literal characters)

I'm getting a score of 9 on a 200+ character message body. Trying
things like adding the D flag and using [^\001-\177] instead also
don't work.

Is the RE engine broken for high-bit characters?

procmail-3.13.1-1

--
 John Hardin KA7OHZ   ICQ#15735746   http://www.wolfenet.com/~jhardin/
 jhardin(_at_)wolfenet(_dot_)com      pgpk -a 
finger://gonzo.wolfenet.com/jhardin
  768: 0x41EA94F5 - A3 0C 5B C2 EF 0D 2C E5  E9 BF C8 33 A7 A9 CE 76 
 1024: 0xB8732E79 - 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
   87 days until Daylight Savings Time begins

<Prev in Thread] Current Thread [Next in Thread>