perl-unicode

Need help writing a UCM file

2002-05-05 19:35:04
Dan,

I have to work with an encoding that is not round trip safe. It has
both trema and umlaut, and it has been decided to map both to U0308
(COMBINING DIAERESIS). So I thought I'd write

    <U0308>  \xC8 |0   # (12/8) Trema
    <U0308>  \xC9 |3   # (12/9) Umlaut

in my ucm file.

Ucmlint reports

    MAB2.ucm:notice in line 192: <U0308>  \xC8 |0
    MAB2.ucm:notice in line 193: <U0308>  \xC9 |3
    MAB2.ucm:warning in line 193: dupe encode map: U0308 => C8 and C9
    MAB2.ucm:notice in line 194: <U030A>  \xCA |0

The resulting module complains

    MAB2 "\xC9" does not map to Unicode at ../lib/Tie/MAB2/Raw.pm line 55.

when it is fed with 'xC9'.

When I change the |3 to |0, then the warning goes away, but is that
the correct way to deal with it?

-- 
andreas

<Prev in Thread] Current Thread [Next in Thread>