Re: how to use int'l data in perl scripts?

I don't know the details of what you're trying to do, so I'll just tellyou what I have done to deal with multiple character sets. It may helpyou; it may not.

If all you want to do is recognize and save/pass along the strings inother character sets, try replacing

use utf8;
with
use bytes;

"use bytes;" works for utf8 strings as well as strings in othercharacter sets.

I think it is best to perform regular expressions on UTF-8 strings -then you can use general property classes such as \p{IsAlpha}. Forthese types of regular expressions I switch to "use utf8;" for that onestatement and then switch back to "use bytes;".

I use Text::Iconv for transferring data between UTF-8 and othercharacter sets. With the project I'm working on, we always do ourprocessing in UTF-8 and transfer to/from other character sets only forsaving and returning data, and only when absolutely necessary (e.g. HTTPfile downloads to OS's that only understand certain character sets forfile names and file contents). We want our inner modules to be asgeneric as possible, and UTF-8 solves our problem better than anythingelse - since it handles all languages. Some people might think this istoo much work, but for our complex framework it's the only way it will work.

For web forms: in order to always get UTF-8 from form posts, we displayour web pages in UTF-8 and use the following <meta> tag in the <head> tag.

<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">

Thanks,
Mary

Mags Doheny wrote:

Hi,

i need to get my perl scripts to recognize strings encoded in other
charsets; the utf8 pragma does the trick for unicode; does anyone know
of other pragmas available for, say, the iso-8859-x charsets?

Thanks,
Mags/