"John D. Hardin" <jhardin(_at_)wolfenet(_dot_)com> writes:
On Wed, 5 Jan 2000, era eriksson wrote:
### WARNING: I have substituted \200 and \377 for literal ASCII 128
### and 255, respectively, in the copy I post to the list
### (not in the next comment, though; it's supposed to be human-readable)
:0 # four or more of \200-\377 in a row in Subject or From
* ^\/(From|Subject):.*[\200-\377][\200-\377][\200-\377][\200-\377]
I'm trying much the same thing in a weighted recipe (e.g. if more than
100 high-bit characters, bounce it), but the pattern matching is
failing badly. For this:
:0
* -100^0
* 1^1 [\200-\376] (again, the recipe has the literal characters)
I'm getting a score of 9 on a 200+ character message body. Trying
things like adding the D flag and using [^\001-\177] instead also
don't work.
Is the RE engine broken for high-bit characters?
Did you mean to include the 'B' flag?
Philip Guenther