On Wed, 03 Dec 2003 14:46:50 -0500 (EST), Dave Stern - Former Rocket
Scientist <dave(_at_)umiacs(_dot_)umd(_dot_)edu> wrote:
On Wed, 3 Dec 2003, John Oliver wrote:
On Tue, Dec 02, 2003 at 06:52:58PM -0800, Professional Software Engineering
wrote:
At 17:41 2003-12-02 -0800, John Oliver wrote:
I'm looking for a procmail rule which will catch all mail that is not in
an English character set. Can anyone point me in the right direction?
Follow the .sig
---
Sean B. Straw / Professional Software Engineering
Procmail disclaimer:
<http://www.professional.org/procmail/disclaimer.html>
Please DO NOT carbon me on list replies. I'll get my copy from the list.
_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail
I'm assuming you're telling me to RTFM... :-) However, I see no utility
to search the list archives. Google isn't helping. There certainly
isn't a "How to block non-English character sets" in the man pages. I
could probably bumble around and figure out a recipe if I knew what I
was looking for in the mail.
I suppose I could download everything and start grepping around. I was
kinda hoping, though, that someone could just point me to a URL, or
point me to a search facility I'm not seeing. I'm just looking for a
nudge in the right direction.
One (possibly extreme) solution is to install spamassassin and have procmail
kick that off. You could then tailor either your user_prefs or the system
local.cf to include
ok_locales en
which will do what you want
I am not a procmail expert, but I see the following lines of code in
Junk Filter, and Spam bouncer, I don't know even if we are allowed to
post this here, but it is an open source code and I am not selling it:).
Maybe this can give a starting point.
Please give credit to the author not to me, this is not my code, I wish
that I am as advanced in procmail as the authors of this code.
# Chinese email
:0
* CHINESE ?? no
* LANGFILTER ?? yes
{
:0
* ^(Content-Type:((.*$)+|.*;)( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)|\
(From|To|Message-ID):.*(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)|\
Subject:.*(hz-)?(big5|gb-?1988|gb-?2312))
{ CN=yes }
:0
* LEANTAG ?? no
* ! CN ?? yes
{
:0 B
* !--.*forwarded message --
* !^forwarded message:
* !^-----BEGIN PGP SIGNED MESSAGE-----
* -1000^0
* -500^0 ^Subject: Re:
* -200^1 ^[:;#>]
* 1100^0 ^Content-Type:.*;( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)
* 1100^0 ^Content-Type:(.*$)+( )*charset( )?=(
)?.?(hz-)?(big5|gb-?1988|gb-?2312)([^a-z0-9\.]|$)
{ CN=yes }
}
:0
* CN ?? yes
{
:0 f
| ${FORMAIL} -A"X-SBRule: Chinese (big5|gb1988|gb2312)"
_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail