Hi Leonardo,
(gdb) print wide_char $2 = 128374 L'\x1f576' (gdb) quit L'\x1f576' (in wide_char) is probably the `dark sunglasses' (U+1F576) unicode character
It is. One of three non-ASCII characters in that subject.
$ uip/scan -file 8759.2.email -format '%(decode{subject})' |
> iconv -t ucs-4le |
> hexdump -ve '5/4 " % 8x" /0 "\n"'
1f576 53 75 6e 2019
73 20 6f 75 74
2c 20 73 61 76
69 6e 67 73 20
4f 4e 2014 73 68
6f 70 20 6d 61
6a 6f 72 20 61
70 70 6c 69 61
6e 63 65 20 64
65 61 6c 73 20
6e 6f 77 a
$
and directly trying to: wcwidth(L'\x1f576') ...returns `-1'.
That would do it. Could you apply the attached patch and re-run? I'm basically interested in how that locale classes it, e.g. iswprint(3). -- Cheers, Ralph. https://plus.google.com/+RalphCorderoy
cpstripped.patch
Description: Text Data
_______________________________________________ Nmh-workers mailing list Nmh-workers(_at_)nongnu(_dot_)org https://lists.nongnu.org/mailman/listinfo/nmh-workers
| Previous by Date: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Ralph Corderoy |
|---|---|
| Next by Date: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
| Previous by Thread: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
| Next by Thread: | Re: [Nmh-workers] nmh-1.7-RC1: scan with complex subjects dumps core, Leonardo Taccari |
| Indexes: | [Date] [Thread] [Top] [All Lists] |