I'm trying to count big letter words in the message body, but
I'm unable to contruct the score recipe right. Say, that
I tolerate 3 big letter words, and if there is more, then
I consider it UBE. The regexp should ignore some words like:
SMTP, AM, IP, base64-decoded-lines.
I started with simple word count, but it doesn't work.
The regexp is supposed to
- start at word border
- must have at least 3 big letters
- have trailing space
max = 3
# Count capitalized words
*$ B ?? 1^0 ()\<[A-Z][A-Z][A-Z]+[ ]
count = $=
dummy = "$count capitalized words"
Content-Type: text/plain; charset="iso-8859-1"
L=E4hett=E4j=E4: Jari Aalto
L=E4hetetty: Tuesday, December 16, 1997 11:04 AM
Vastaanottaja: xx xx
TAMAN PAIVAN OSALTA ALKAA PROJEKTITEHTAILU OLLA VAIHTEEKSI KASASSA. =
txt txt txt ...