I'm certain many of you have already seen this but I think rereading
what Bruce Schneier says never hurt anyone:
http://www.counterpane.com/crypto-gram-0007.html#9
See also Markus Kuhn's UTF-8 material, including the (in)famous UTF-8
decoder stress test:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt (section 4)
ftp://sunsite.doc.ic.ac.uk/packages/rfc/rfc2279.txt (section 6)
See also ext/Encode/Todo in the latest Perl developer snapshots.
In the context of Perl I'll see what I can do.
--
$jhi++; # http://www.iki.fi/jhi/
# There is this special biologist word we use for 'stable'.
# It is 'dead'. -- Jack Cohen