perl-unicode

Re: Need help writing a UCM file

2002-05-05 21:58:06
On Monday, May 6, 2002, at 11:34 , Andreas J. Koenig wrote:
Dan,

I have to work with an encoding that is not round trip safe. It has
both trema and umlaut, and it has been decided to map both to U0308
(COMBINING DIAERESIS). So I thought I'd write

    <U0308>  \xC8 |0   # (12/8) Trema
    <U0308>  \xC9 |3   # (12/9) Umlaut

in my ucm file.

Ucmlint reports

    MAB2.ucm:notice in line 192: <U0308>  \xC8 |0
    MAB2.ucm:notice in line 193: <U0308>  \xC9 |3
    MAB2.ucm:warning in line 193: dupe encode map: U0308 => C8 and C9
    MAB2.ucm:notice in line 194: <U030A>  \xCA |0

The resulting module complains

MAB2 "\xC9" does not map to Unicode at ../lib/Tie/MAB2/Raw.pm line 55.

when it is fed with 'xC9'.

When I change the |3 to |0, then the warning goes away, but is that
the correct way to deal with it?

Do you have ENCODING duplicate somewhere? Like \C9 defined not only to map <U0308> but also elsewhere? I would appreciate if you sent me a whole UCM file.

Dan the Encode Maintainer

<Prev in Thread] Current Thread [Next in Thread>