perl-unicode

RE: Problem with Encode module

2006-07-07 16:21:06
Thanks John. The original characters came from Japan, don't know if they
use some proprietary extension of shift_jis. Attached is the zipped
file. Hope it come across correctly this time. It should contains
characters 0x 87408741. 

-----Original Message-----
From: John Delacour [mailto:JD(_at_)BD8(_dot_)COM] 
Sent: Friday, July 07, 2006 4:02 PM
To: Jianyang Tai; perl-unicode(_at_)perl(_dot_)org
Subject: RE: Problem with Encode module

At 2:40 pm -0700 7/7/06, Jianyang Tai wrote:

Thanks for the reply. Are you sure those characters don't exist n 
shift-jis?  Please take a look at the attached text file. It contains 
two characters ("1" in a circle and "2" in a circle). The file is in 
shift-jis encoding.

Not possible. Here is an extract from
 
<ftp://ftp.unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/JIS/SHIFTJIS.TX
T>
which shows a large gap where you say there are characters assigned. 
I presume the file is produced using some proprietary extension of
Shift_JIS with extra characters added, as is the case with Mac-Japanese,
but I suspect also that your file was corrupted in transit since it
contains characters 0x BD40BD41.  You would need to send it zipped.

0x84BB  0x2530  # BOX DRAWINGS DOWN HEAVY AND HORIZONTAL LIGHT
0x84BC  0x2525  # BOX DRAWINGS VERTICAL LIGHT AND LEFT HEAVY
0x84BD  0x2538  # BOX DRAWINGS UP HEAVY AND HORIZONTAL LIGHT
0x84BE  0x2542  # BOX DRAWINGS VERTICAL HEAVY AND HORIZONTAL LIGHT
0x889F  0x4E9C  # <CJK>
0x88A0  0x5516  # <CJK>
0x88A1  0x5A03  # <CJK>
0x88A2  0x963F  # <CJK>
0x88A3  0x54C0  # <CJK>

JD

Attachment: test_shiftjis.zip
Description: test_shiftjis.zip

<Prev in Thread] Current Thread [Next in Thread>