Re: From field in ISO-2022-JP

Earl Hood wrote:

There may be a use in having the ability to tell MHonArc how
to map characters, overriding the default tables.  Maybe something
like the following:

  <CharsetMap>
  euc-jp
  <UFF5E> \xA1\xC1
  </CharsetMap>

The first line signifies the character/code set name.  Subsequent
lines associate Unicode code points to code points in the specified
set name.

This would allow a user to customize MHonArc::CharEnt conversion
tables to resolve an ambiguity according to the preferences of
a particular locale.  It seems that visual appearance is the most
important issue, but one would have to be careful since making such
changes could affect other software, like search engines.

Would such a feature be useful?


Basically, it is a good idea to let users choose their favorite
charset map.  But, allowing them to create their own maps seems
to be a source of confusion.
  Is it difficult to let users choose a conversion from the following?

        x-eucjp-unicode-0.9
        x-eucjp-jisx0221-1995
        x-eucjp-open-19970715-ms
        x-eucjp-open-19970715-0201
        x-eucjp-open-19970715-ascii

(http://www.w3.org/TR/japanese-xml/#AEN35663200)

  And, from a practical point of view, I recommend
x-eucjp-open-19970715-ms, because this mapping is supported by
Mozilla (at least on Linux & Windows), and Internet Explorer.

---------------------------------------------------------------------
To sign-off this list, send email to majordomo(_at_)mhonarc(_dot_)org with the
message text UNSUBSCRIBE MHONARC-USERS