On 12 February 2000, Walter Dnes <waltdnes(_at_)waltdnes(_dot_)org> wrote:
[...]
I'm doing a "normalized" character count. The filter is a two-step
process...
1) it counts the "total number of characters"
2) it then subtracts 20 times the count of high-bit characters
in the range 160..255. This allows 5% safety margin in case
for the occasional "Copyright"/"Trademark"/"Registered"
symbol. If the safety margin is exceeded, the score is
positive, and the filter activates.
[...]
You don't happen to speak French, do you? :-)
Regards,
Liviu Daia
--
Dr. Liviu Daia e-mail: Liviu(_dot_)Daia(_at_)imar(_dot_)ro
Institute of Mathematics web page: http://www.imar.ro/~daia
of the Romanian Academy PGP key: http://www.imar.ro/~daia/daia.asc