perl-unicode

ftp://ftp.math.ohio-state.edu/pub/users/ilya/Unicode/pictures/

1999-11-22 17:33:06
I put Version 2 of compressed unicode vtables into

  ftp://ftp.math.ohio-state.edu/pub/users/ilya/Unicode/pictures/

Somehow I could not read unicode tables without visual feedback, so I
combined glyphs downloaded from Unicode place with unicode tables into
a small collection of images.

The directory ../bin contains scripts used for this.  load was working
in August.  makeGIF.pl requires old GD (1.19 or older).  I used
unicode tables supplied with 5.005_62 to annotate pictures.

(Thus glyphs are older than annotations, hope this does not create any
confusion. ;-)

Hints: 

  *) xv views this nice on 1280x1024 screens;
  *) I printed this on Letter-size using 56% magnification of xv (128 dpi).
  *) green-framed glyphs needed scaling;
  *) ""-quoted names (=purple) are old names for control chars;
  *) Names of ranges (from ul corner) are stripped from char names;
  *) The abbreviations used in names:
         lc SMALL LETTER
         uc CAPITAL LETTER
         \w LETTER
         w WITH
         sn SIGN
         mk MARK
         ac ACCENT
         \d DIGIT
  *) Format of the type (=pink):

    general [combining] [<decomposition>] [numeric] ["comment"] [casemap]

     x) "Numeric" looks like =9, ==9, ===9 depending on whether it is a
        decimal digit, digit, or number;
     x) "decomposition" has < and > stripped (but is put into <> after this);
     x) "casemap" is uc=... lc=... tc=... ("..." may contain several chars)
     x) "general" is mapped using
         Lu => 'uc',                  # Letter, Uppercase
         Ll => 'lc',                  # Letter, Lowercase
         Lt => 'Ttlc',                # Letter, Titlecase
         Mn => 'nonspacing Mark',     # Mark, Non-Spacing
         Mc => 'spacing combining mark',  # Mark, Spacing Combining
         Me => 'enclosing mark',          # Mark, Enclosing
         Nd => 'decimal',             # Number, Decimal Digit
         Nl => 'letter-number',       # Number, Letter
         No => 'other-number',        # Number, Other
         Zs => 'space',               # Separator, Space
         Zl => 'line-sep',            # Separator, Line
         Zp => 'paragr-sep',          # Separator, Paragraph
         Cc => 'control',             # Other, Control
         Cf => 'format',              # Other, Format
         Cs => 'surrog',              # Other, Surrogate
         Co => 'private-use',         # Other, Private Use
         Cn => '???',                 # Other, Not Assigned (no)
         Lm => 'modifier-letter',     # Letter, Modifier
         Lo => 'other-letter',        # Letter, Other
         Pc => 'connector',           # Punctuation, Connector
         Pd => 'dash',                # Punctuation, Dash
         Ps => 'punct-open',          # Punctuation, Open
         Pe => 'punct-close',         # Punctuation, Close
         Pi => 'ini-quote',           # Punctuation,  Initial quote (may
                                      behave like Ps or Pe depending
         Pf => 'fin-quote',           # Punctuation,  Final  quote  (may behave 
like
                                      Ps or Pe depending
         Po => 'other-punct',         # Punctuation, Other
         Sm => 'math',                # Symbol, Math
         Sc => 'currency',            # Symbol, Currency
         Sk => 'modifier-symbol',     # Symbol, Modifier
         So => 'other-symbol',        # Symbol, Other

      x) "combining" is mapped using
         0 => '',                     #: Spacing, split, enclosing, reordrant, 
and
                                      Tibetan subjoined
         1 => 'over/inter',           #: Overlays and interior
         7 => 'nukta',                #: Nuktas
         8 => 'voicing',              #: Hiragana/Katakana voicing marks
         9 => 'virama',               #: Viramas
         10 => 'fixed-start',         #: Start of fixed position classes
         199 => 'fixed-end',          #: End of fixed position classes
         200 => '|_',                 #: Below left attached
         202 => '_',                  #: Below attached
         204 => '_|',                 #: Below right attached
         208 => '<',                  #: Left attached (reordrant around single 
base
                                      character)
         210 => '>',                  #: Right attached
         212 => '|~',                 #: Above left attached
         214 => '~',                  #: Above attached
         216 => '~|',                 #: Above right attached
         218 => '||_',                #: Below left
         220 => '__',                 #: Below
         222 => '_||',                #: Below right
         224 => '<<',                 #: Left (reordrant around single base 
character)
         226 => '>>',                 #: Right
         228 => '||~',                #: Above left
         230 => '~~',                 #: Above
         232 => '~||',                #: Above right
         233 => 'vv',                 #: Double below
         234 => '^^',                 #: Double above
         240 => '_iota_',             #: Below (iota subscript)

       [handling of 10..199 whould be improved, right now it prints comb?/num]

Oups, I see now that I forgot to put BIDIR into the syntax type...

Enjoy,
Ilya

<Prev in Thread] Current Thread [Next in Thread>
  • ftp://ftp.math.ohio-state.edu/pub/users/ilya/Unicode/pictures/, Ilya Zakharevich <=