procmail
[Top] [All Lists]

Re: regex syntax question

2004-03-02 20:05:53
At 15:56 2004-03-02 -0700, LuKreme wrote:
On 02 Mar 2004, at 14:14, Alan Clifford wrote:
# 10% of chars are &#
:0 BD
* -1^1 .
* 10^1 [&#]
action as spam line here

How big a hit do you take on counting every character like that? And you don't restrict it by size?

how long doe sit take on a 300K or 1500K message?

In the grand scheme of things, it's nominal (esp if you're running SA! <g>). Launch some processes, write a mailbox file - those are the things that have a propensity to chew mounds of processing time.

On a 4.3M mailbox, processed as one honking message (to reduce formail spit time and individual procmail invocations):

real    0m3.818s
user    0m3.550s
sys     0m0.230s

Answered my own question, eliminated the 'D' flag, and the user bumped up by about 1% (these over multiple runs each).

My own version eliminates the character class brackets (looking for &# as a character pair), and runs in:

real    0m3.711s
user    0m3.430s
sys     0m0.240s

This is on a not particularly screaming test box - just 550MHz, but it is plenty to the task, even with other duties. Heavier iron around here tends to have a heavier load.

Stub out the included filter, and the overhead for the sandbox is shown as:

real    0m0.287s
user    0m0.010s
sys     0m0.230s


Heavier duty SMP hardware, with a test file 8.3M in size (just the bare recipe, not in a sandbox environ):

real    0m3.930s
user    0m3.352s
sys     0m0.312s

---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.


_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

<Prev in Thread] Current Thread [Next in Thread>