Here's some general information for Perl-Unicode porters I gleaned
from the meeting mentioned below (IIRC).
The Mac OS 9 maintains in the file system table whether a file is 'text' or
'utxt' (Unicode). Thus the Perl open() command would know on that
platform when to expect Unicode if the OS had been told.
BOM is not needed.
Also, the filenames are 'maximally decomposed' UTF-16. This is to
facilitate sorting and comparison.
Peter Edberg is the guy to contact for more info.
The International Macintosh Users Group presents:
Unicode Support in Mac OS 9
Group: International Macintosh Users Group (IMUG)
(A Forum for Multilingual / Multiscript Computing)
Date: November 18, 1999, 7-9 p.m.
Speakers: Yasuo Kida, Peter Edberg (Apple Computer, Inc.)
Topic: Unicode Support in Mac OS 9
Location: Apple Computer, Apple Campus, 1 Infinite Loop, Cupertino
Take Saratoga/Sunnyvale exit off 280, turn South into
Cupertino, turn left onto Mariani Avenue, left into
Admission: $2, free for IMUG members
Contact: Roger Sherman, (650) 859-5981
Yasuo Kida and Peter Edberg will describe and demonstrate some of
the features of Unicode and multilingual support in Mac OS 9.0:
* Unicode input using the Text Services Manager, with Unicode-based
input methods and Unicode keyboard layouts. This area will be
covered in the greatest depth, including technical details.
* Unicode text utilities.
* Unicode imaging using ATSUI and Unicode fonts.
* The Multilingual Text Editor, an object-oriented text editor that
uses the above facilities to provide Unicode editing capabilities.
Yasuo Kida is the manager of Engineering Tokyo in Apple's Mac OS
international software group responsible for CJK technologies. His
group's recent work includes the Kotoeri input method, language
engine APIs, Japanese tokenization based on morphological analysis,
and fonts. He also participated in the design of the Java input
method framework for the Java 2 platform.
Peter Edberg has worked in Apple's Mac OS international software
group for 11 years, at sites in Cupertino, France, China, and Oregon.
Recent areas of focus include text encoding conversion, Unicode text
utilities, and Unicode input. Before Apple, he worked on Japanese
handwriting input for 6 years, preceded by a couple of years in SRI
International's bioinformation systems group.