Hi Leonardo,
(gdb) print wide_char $2 = 128374 L'\x1f576' (gdb) quit L'\x1f576' (in wide_char) is probably the `dark sunglasses' (U+1F576) unicode character
It is. One of three non-ASCII characters in that subject. $ uip/scan -file 8759.2.email -format '%(decode{subject})' | > iconv -t ucs-4le | > hexdump -ve '5/4 " % 8x" /0 "\n"' 1f576 53 75 6e 2019 73 20 6f 75 74 2c 20 73 61 76 69 6e 67 73 20 4f 4e 2014 73 68 6f 70 20 6d 61 6a 6f 72 20 61 70 70 6c 69 61 6e 63 65 20 64 65 61 6c 73 20 6e 6f 77 a $
and directly trying to: wcwidth(L'\x1f576') ...returns `-1'.
That would do it. Could you apply the attached patch and re-run? I'm basically interested in how that locale classes it, e.g. iswprint(3). -- Cheers, Ralph. https://plus.google.com/+RalphCorderoy
cpstripped.patch
Description: Text Data
_______________________________________________ Nmh-workers mailing list Nmh-workers(_at_)nongnu(_dot_)org https://lists.nongnu.org/mailman/listinfo/nmh-workers
Previous by Date: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Ralph Corderoy |
---|---|
Next by Date: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
Previous by Thread: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
Next by Thread: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
Indexes: | [Date] [Thread] [Top] [All Lists] |