namazu-users-en
[Top] [All Lists]

Re: Malformed UTF-8 character ...

2004-05-07 16:54:28
Earl Hood wrote:

You Instead of "en_US.UTF-8" You have to set "C".

probably "LC_ALL" or "LC_CTYPE" etc. It is en_US.UTF-8.
Please set up LC_ALL=C and use mknmz.

Is there any drawback of including the "use bytes" pragma to
avoid this problem?  

Since it is not tested at all, it may have a fault.
8-bit character sequence It is the basis of the assumption "ja_JP.*" 
and is made.

Besides mailnews.pl, it is man.pl etc. The 8-bit character sequence 
(EUC-JP) is used.

Now We recommend you to use "C" strongly.

Is there a need to support older versions
of perl that do not support the pragma?

Yes.
In however, that case "eval" could be used.

ex)
  eval 'use byte';
-- 
=====================================================================
TADAMASA TERANISHI  yw3t-trns(_at_)asahi-net(_dot_)or(_dot_)jp
http://www.asahi-net.or.jp/~yw3t-trns/index.htm
Key fingerprint =  474E 4D93 8E97 11F6 662D  8A42 17F5 52F4 10E7 D14E


<Prev in Thread] Current Thread [Next in Thread>