procmail
[Top] [All Lists]

Re: Non-English character sets

2003-12-03 18:53:28
On Wed, 03 Dec 2003 14:46:50 -0500 (EST), Dave Stern - Former Rocket
Scientist <dave(_at_)umiacs(_dot_)umd(_dot_)edu> wrote:

On Wed, 3 Dec 2003, John Oliver wrote:

On Tue, Dec 02, 2003 at 06:52:58PM -0800, Professional Software Engineering 
wrote:
At 17:41 2003-12-02 -0800, John Oliver wrote:
I'm looking for a procmail rule which will catch all mail that is not in
an English character set.  Can anyone point me in the right direction?

Follow the .sig

---
  Sean B. Straw / Professional Software Engineering

  Procmail disclaimer: 
<http://www.professional.org/procmail/disclaimer.html>
  Please DO NOT carbon me on list replies.  I'll get my copy from the list.


_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

I'm assuming you're telling me to RTFM... :-)  However, I see no utility
to search the list archives.  Google isn't helping.  There certainly
isn't a "How to block non-English character sets" in the man pages.  I
could probably bumble around and figure out a recipe if I knew what I
was looking for in the mail.

I suppose I could download everything and start grepping around.  I was
kinda hoping, though, that someone could just point me to a URL, or
point me to a search facility I'm not seeing.  I'm just looking for a
nudge in the right direction.


One (possibly extreme) solution is to install spamassassin and have procmail
kick that off. You could then  tailor either your user_prefs or the system
local.cf to include

ok_locales  en

which will do what you want

I am not a procmail expert, but I see the following lines of code in
Junk Filter, and Spam bouncer, I don't know even if we are allowed to
post this here, but it is an open source code and I am not selling it:).
Maybe this can give a starting point.

Please give credit to the author not to me, this is not my code, I wish
that I am as advanced in procmail as the authors of this code.



# Chinese email
:0
* CHINESE ?? no
* LANGFILTER ?? yes
{
 :0
 *  ^(Content-Type:((.*$)+|.*;)( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)|\

(From|To|Message-ID):.*(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)|\
      Subject:.*(hz-)?(big5|gb-?1988|gb-?2312))
 { CN=yes }

 :0
 * LEANTAG ?? no
 * ! CN ?? yes
 {
  :0 B
  * !--.*forwarded message --
  * !^forwarded message:
  * !^-----BEGIN PGP SIGNED MESSAGE-----
  * -1000^0
  *  -500^0   ^Subject: Re:
  *  -200^1   ^[:;#>]
  *  1100^0   ^Content-Type:.*;( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)
  *  1100^0   ^Content-Type:(.*$)+( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)
  { CN=yes }
 }

 :0
 * CN ?? yes
 {
  :0 f
  | ${FORMAIL} -A"X-SBRule: Chinese (big5|gb1988|gb2312)"




_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

<Prev in Thread] Current Thread [Next in Thread>