perl-unicode

What to do with non-assigned points?

2002-03-18 03:13:15

There has been a bug reported that

use Encode qw(from_to); use strict;
my $s = join "", map { chr } 128..255;
for my $enc (qw( iso8859_3 ))
{
   from_to($s,$enc,"utf-8");
}

complains. (The complain will be cleaned up but problem still remains.)

The problem is that iso-8859-3 does not assign characters to all octet
values.

What should Encode:: do in such cases:
  A. U+FFFD
  B. Map octet to Unicode/iso-8859-1
  C. Use a "private use" page...


--
Nick Ing-Simmons
http://www.ni-s.u-net.com/



<Prev in Thread] Current Thread [Next in Thread>