perl-unicode

[mark(_dot_)davis(_at_)jtcsv(_dot_)com: Unicode 3.2 Released]

2002-03-27 20:10:18
----- Forwarded message from Mark Davis <mark(_dot_)davis(_at_)jtcsv(_dot_)com> 
-----

Subject: Unicode 3.2 Released
From: "Mark Davis" <mark(_dot_)davis(_at_)jtcsv(_dot_)com>
Date: Wed, 27 Mar 2002 18:46:39 -0800
Message-ID: <00f301c1d602$c9b203d0$68de2b09(_at_)DAVIS1>
To: <unicore(_at_)unicode(_dot_)org>, <unicode(_at_)unicode(_dot_)org>, 
<w3c-i18n-ig(_at_)w3(_dot_)org>
Errors-to: unicore-bounce(_at_)unicode(_dot_)org

Unicode 3.2 has been released! The data files and documentation are final
and posted on the Unicode site. For details, see UAX #28, Unicode 3.2 at
<http://www.unicode.org/unicode/reports/tr28/>.

Unicode 3.2 adds 1,016 additional characters, and now includes the most
extensive set of characters for mathematical and technical publishing yet
defined. Encodings for four indigenous scripts of the Philippines have been
added to the repertoire, as well as new special characters such as the Word
Joiner, Grapheme Joiner, and Variation Selectors. A number of characters are
also added for interoperating with JIS X 0213:2000.

Unicode 3.2 has also updated a number of conformance issues, including
tightening of the definition of UTF-8, when to replace ZWNBSP (BOM) with
Word Joiner, and clarifications of the use of Combining Marks, Hangul
Syllables, and Conjoining Jamo. Case Mappings (formerly UTR#21) is now part
of the Unicode Standard. Additional explanatory material has been added for
decomposition and for specific characters and scripts; in particular,
clarifications and charts have been added to the Khmer script block to
better explain the encoding.

New properties have been added to the Unicode Character Database. These
include: recommended UCD property names and property value names; an exact
specification of when variation selectors can be used; when each character
was added to the Unicode Standard; grapheme cluster information, ideographic
description categories; code points that are ignorable by default, and
deprecated characters.

Unicode 3.2 is fully synchronized with International Standard ISO/IEC
10646-1:2000, with its Amendment 1, and with ISO/IEC 10646-2:2001.

Please note that while all of the online data files and documentation for
the Unicode 3.2 have been updated, it will take a few days to complete the
rollup of all of the revised pdf code charts on the website. Charts for the
specific additional characters for Version 3.2 can, however, already be
accessed via the links in UAX #28.

Also note that material on default grapheme clusters has been moved from
the beta version of Unicode 3.2 into a new proposed draft UTR #29: Text
Boundaries at <http://www.unicode.org/unicode/reports/tr29/>.

Mark


----- End forwarded message -----

-- 
$jhi++; # http://www.iki.fi/jhi/
        # There is this special biologist word we use for 'stable'.
        # It is 'dead'. -- Jack Cohen

<Prev in Thread] Current Thread [Next in Thread>
  • [mark(_dot_)davis(_at_)jtcsv(_dot_)com: Unicode 3.2 Released], Jarkko Hietaniemi <=