procmail
[Top] [All Lists]

Re: Searching for html comments

2002-08-26 05:58:59
Recently I got spam that had most of the words in it broken up by html
comments.  We would have a few letters, then and html comment and
another couple of letters.  I tried devising a recipe to search for
that, but the log says its being skipped.  Can someone tell me what's
wrong with it?  Here it is.

:0 B:
* <\!--[0-9_a-zA-Z-->
junkmail

There are a couple of problems here.  '<' at the start of a condition
line is magic (it tests the size of a mail) so you need to put something
in front of it to tell procmail that the regular expression has started
- for example, empty parentheses.  The other problem is that you have no
closing right square bracket.  Once that is fixed you probably want a
'+' to match more than just one character in the HTML comment:

  :0 B:
  * ()<!--[0-9_a-z]+-->
  junkmail

procmail isn't case sensitive unless you tell it to be so, so you don't
need to test for both a-z and A-Z.  Are you sure you don't need to allow
for spaces in there also?

Hope that helps,
Martin
-- 
Martin McCarthy                 /</    http://procmail.non-prophet.org
    `Procmail Companion'        \>\
     Addison Wesley             /</                  PGP key available
_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

<Prev in Thread] Current Thread [Next in Thread>