Re: [Fwd: interest in the rules language]


On Fri, 2005/03/25 (MST), <info(_at_)utel(_dot_)net> wrote:

there is a "detail" wich was not considered and I focus on: it is thedocumentation of the language in which the text on wich the rules are toapply. And also in which language the program is to be entered. No useto have a P or Sieve language in ASCII for an Arabic, a Chinese, aRussian, etc. e-mail exchange.

I do not know how Sieve addresses this, but if we go with P, I would arguethat P should use UTF-8 for encoding. Would using UTF-8 to write rulesaddress the above concern?

A language is documented by a langtag. I was among those who blocked thesecond last call of the proposed RFC 3066 revamp and obtained a WG-ltruto consider it. One of the major point of contension was the refusal ofthe authors to consider the OPES requirements. Like Web Services, weneed to have a clearly defined language to filter. This concerns boththe header (lingual name, subject, etc.) and the content. The author ofthe Draft (W3C and Unicode) are interested in two main things as far asI understand: documenting the language of the page for HTML and XML anddefining the language as part if the UNICODE CLDR effort to define allthe locales of all the OSes.

Are you talking about detecting the encoding and/or language of variousmessage parts? If yes, then I think the nobel efforts above are outside ofthe rules language core. There should be a mechanism to check whatencoding/language is used, but how that information is stored in a message[part] is pretty much irrelevant to the core of the rules language, right?


Thanks,

Alex.