procmail
[Top] [All Lists]

Re: Filtering on ISO-8859-1 encoded Subject lines

2004-08-31 07:16:54
Toen wij J Britain kietelden, kwam er dit uit:

When filtering UCE, quite often these days the Subject line contents
are encoded with single or multiple encoded iso-8859-1 strings.

What do you mean by 'multiple encoded'?


I've found: http://www.xs4all.nl/~rvtol/procmail/bq_head.rc

See also http://www.xs4all.nl/~rvtol/procmail --> bq_demo.rc


But I am unsure how to incorporate it as a filter such that I can save
messages to a file (mbox format) based upon filter words, such as
(pharmacy|medication|moneyback|pills) etc etc
and others to "pass through" (not-trapped).  Currently "not trapping"
is based upon a recognized address in the header, but this has to be
monitored for false positives, and manually added.

I would much rather have a specific list of reject words, so
monitoring is unnecessary.

# see http://www.xs4all.nl/~rvtol/procmail --> basic --> pm
#     H_vars.inc and globals.inc
# and http://www.xs4all.nl/~rvtol/procmail --> inc
#     bq.inc (etc.)

  SHELL     = /bin/sh
  INCLUDERC = $HOME/pm/H_vars.inc   # a.o. sets H_Subj (and INCDIR)

  :0
  * H_Subj ?? ()=\?[a-z0-9-]+\?[bq]\?\/[^?]+
  {
    bq_Input  = $MATCH
    INCLUDERC = $INCDIR/inc/bq.inc
    H_Subj    = $bq_Result
  }

  :0:
  * H_Subj ?? (pharmacy|(m|rn)ed[i1]cat[i1][o0]n|money ?back|pills)
  mbox.suspect

(untested)

-- 
Grtz, Ruud


____________________________________________________________
procmail mailing list   Procmail homepage: http://www.procmail.org/
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

<Prev in Thread] Current Thread [Next in Thread>