IEM - network operating center wrote:
If it is "All check passed.", it is shown that there are no problems
in NMZ.w and NMZ.wi.
unfortunately the check fails for the said list.
it is rather like this:
<snip>
==============================
check 1
==============================
nul : 0
control : 1
cr : 0
0x80 - 0xff : 2573
One word including control char is included in the index.
2573 words including 8bit character are included in the index.
==============================
check 3
==============================
152027: 1506819 1506800
152028: 1506871 1506852
The gap of 19 bytes for the 152027th is caused.
Therefore, 152028 seems to shift by 19 bytes.
Everything might shift after this.
374169: 4514255 4514197
374170: 4514308 4514250
374171: 4514361 4514303
It accumulates and 58 shifts.
Because it was a gap of 19, it is understood first that there are
two or more parts where the gap is generated.
It cannot be declared whether the index broke because of 8bit
character being possible this to hold.
fail !!
==============================
1 check failed.
Anyway, because the index is broken, it is not possible to retrieve
it correctly.
--- namazu-2.0.14/scripts/mknmz.in 2004-04-08 17:34:42.000000000
+0900
+++ mknmz.in 2005-11-25 14:21:26.000000000 +0900
@@ -2250,7 +2250,7 @@ sub count_words ($$$$) {
$$contref =~ tr/A-Z/a-z/;
# Remove control char.
- $$contref =~ tr/\x00-\x08\x0b-\x0c\x0e-\x1a/ /;
+ $$contref =~ tr/\x00-\x08\x0b-\x0c\x0e-\x1a\x80-\xff/ /;
# Do wakatigaki if necessary.
if (util::islang("ja")) {
Let's correct the above-mentioned, delete the index, and execute
mknmz again.
And, please confirm the index with nmzchkw.pl again.
--
=====================================================================
TADAMASA TERANISHI yw3t-trns(_at_)asahi-net(_dot_)or(_dot_)jp
http://www.asahi-net.or.jp/~yw3t-trns/index.htm
Key fingerprint = 474E 4D93 8E97 11F6 662D 8A42 17F5 52F4 10E7 D14E
_______________________________________________
Namazu-users-en mailing list
Namazu-users-en(_at_)namazu(_dot_)org
http://www.namazu.org/cgi-bin/mailman/listinfo/namazu-users-en