Quoting Earl Hood (earl(_at_)earlhood(_dot_)com):
On May 24, 2004 at 20:38, David L. Dewey wrote:
Got lots of these...
Malformed UTF-8 character (unexpected continuation byte
0xa1, with no preceding start byte) at
/usr/local/share/namazu/pl/gfilter.pl line 99.
Malformed UTF-8 character (unexpected continuation byte
0xa3, with no preceding start byte) at
/usr/local/share/namazu/pl/gfilter.pl line 99.
Malformed UTF-8 character (unexpected continuation byte
0xa1, with no preceding start byte) at
/usr/local/share/namazu/pl/gfilter.pl line 99.
...
Check your *LANG* environment settings. Before running mharc scripts,
or mknmz, set them to the C locale. Namazu does not support UTF-8
locale settings.
Thanks, Earl, but that didn't seem to work... LANG is now
set to C. I began the reindex and it ran for a long time
w/o error, but then suddenly blew up with tens of thousands
of these again:
Malformed UTF-8 character (unexpected continuation byte
0xb8, with no preceding start byte) in pattern match (m//)
at /usr/local/share/namazu/filter/mailnews.pl line 216,
<GEN3> line 45191.
Could this be that Text::Kakasi is not installed? I cannot
get the module installed, either using cpan or manually.
All other namazu requirements should be met.
dave
---------------------------------------------------------------------
To sign-off this list, send email to majordomo(_at_)mhonarc(_dot_)org with the
message text UNSUBSCRIBE MHARC-USERS