[tex-live] next encoding
James Cloos
cloos at jhcloos.com
Sun Mar 1 07:24:45 CET 2009
Markus Kuhn’s uniset¹ has this mapping file from Unicode Inc:
,----[ NEXTSTEP.TXT ]
| #
| # Name: NextStep Encoding to Unicode
| # Unicode version: 1.1
| # Table version: 0.1
| # Table format: Format A
| # Date: 1999 September 23
| # Authors: Rick McGowan
| #
| # Copyright (c) 1991-1999 Unicode, Inc. All Rights reserved.
| #
| # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
| # No claims are made as to fitness for any particular purpose. No
| # warranties of any kind are expressed or implied. The recipient
| # agrees to determine applicability of information provided. If this
| # file has been provided on optical media by Unicode, Inc., the sole
| # remedy for any claim will be exchange of defective media within 90
| # days of receipt.
| #
| # Unicode, Inc. hereby grants the right to freely use the information
| # supplied in this file in the creation of products supporting the
| # Unicode Standard, and to make copies of this file in any form for
| # internal or external distribution as long as this notice remains
| # attached.
| #
| # General notes:
| #
| # This table contains the data the Unicode Consortium has on how
| # NextStep Encoding characters map into Unicode. Since the first
| # 128 characters (0x0 - 0x7f) are identical to ASCII and Unicode,
| # this table only maps the NextStep range from 0x80 - 0xFF.
| #
| # This file is provided for historical reference only and pertains
| # to NextStep and OpenStep products shipped prior to the aquisition
| # of NeXT by Apple Computer, Inc. See http://www.apple.com for
| # further information.
| #
| # Format: Three tab-separated columns
| # Column #1 is the NextStep code (in hex as 0xXX)
| # Column #2 is the Unicode (in hex as 0xXXXX)
| # Column #3 NextStep name, Unicode name (follows a comment sign, '#')
| #
| # The entries are in NextStep order
| #
| # Any comments or problems, contact info at unicode.org
| #
| 0x80 0x00a0 # NO-BREAK SPACE
| 0x81 0x00c0 # LATIN CAPITAL LETTER A WITH GRAVE
| 0x82 0x00c1 # LATIN CAPITAL LETTER A WITH ACUTE
| 0x83 0x00c2 # LATIN CAPITAL LETTER A WITH CIRCUMFLEX
| 0x84 0x00c3 # LATIN CAPITAL LETTER A WITH TILDE
| 0x85 0x00c4 # LATIN CAPITAL LETTER A WITH DIAERESIS
| 0x86 0x00c5 # LATIN CAPITAL LETTER A WITH RING
| 0x87 0x00c7 # LATIN CAPITAL LETTER C WITH CEDILLA
| 0x88 0x00c8 # LATIN CAPITAL LETTER E WITH GRAVE
| 0x89 0x00c9 # LATIN CAPITAL LETTER E WITH ACUTE
| 0x8a 0x00ca # LATIN CAPITAL LETTER E WITH CIRCUMFLEX
| 0x8b 0x00cb # LATIN CAPITAL LETTER E WITH DIAERESIS
| 0x8c 0x00cc # LATIN CAPITAL LETTER I WITH GRAVE
| 0x8d 0x00cd # LATIN CAPITAL LETTER I WITH ACUTE
| 0x8e 0x00ce # LATIN CAPITAL LETTER I WITH CIRCUMFLEX
| 0x8f 0x00cf # LATIN CAPITAL LETTER I WITH DIAERESIS
| 0x90 0x00d0 # LATIN CAPITAL LETTER ETH
| 0x91 0x00d1 # LATIN CAPITAL LETTER N WITH TILDE
| 0x92 0x00d2 # LATIN CAPITAL LETTER O WITH GRAVE
| 0x93 0x00d3 # LATIN CAPITAL LETTER O WITH ACUTE
| 0x94 0x00d4 # LATIN CAPITAL LETTER O WITH CIRCUMFLEX
| 0x95 0x00d5 # LATIN CAPITAL LETTER O WITH TILDE
| 0x96 0x00d6 # LATIN CAPITAL LETTER O WITH DIAERESIS
| 0x97 0x00d9 # LATIN CAPITAL LETTER U WITH GRAVE
| 0x98 0x00da # LATIN CAPITAL LETTER U WITH ACUTE
| 0x99 0x00db # LATIN CAPITAL LETTER U WITH CIRCUMFLEX
| 0x9a 0x00dc # LATIN CAPITAL LETTER U WITH DIAERESIS
| 0x9b 0x00dd # LATIN CAPITAL LETTER Y WITH ACUTE
| 0x9c 0x00de # LATIN CAPITAL LETTER THORN
| 0x9d 0x00b5 # MICRO SIGN
| 0x9e 0x00d7 # MULTIPLICATION SIGN
| 0x9f 0x00f7 # DIVISION SIGN
| 0xa0 0x00a9 # COPYRIGHT SIGN
| 0xa1 0x00a1 # INVERTED EXCLAMATION MARK
| 0xa2 0x00a2 # CENT SIGN
| 0xa3 0x00a3 # POUND SIGN
| 0xa4 0x2044 # FRACTION SLASH
| 0xa5 0x00a5 # YEN SIGN
| 0xa6 0x0192 # LATIN SMALL LETTER F WITH HOOK
| 0xa7 0x00a7 # SECTION SIGN
| 0xa8 0x00a4 # CURRENCY SIGN
| 0xa9 0x2019 # RIGHT SINGLE QUOTATION MARK
| 0xaa 0x201c # LEFT DOUBLE QUOTATION MARK
| 0xab 0x00ab # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
| 0xac 0x2039 # LATIN SMALL LETTER
| 0xad 0x203a # LATIN SMALL LETTER
| 0xae 0xfb01 # LATIN SMALL LIGATURE FI
| 0xaf 0xfb02 # LATIN SMALL LIGATURE FL
| 0xb0 0x00ae # REGISTERED SIGN
| 0xb1 0x2013 # EN DASH
| 0xb2 0x2020 # DAGGER
| 0xb3 0x2021 # DOUBLE DAGGER
| 0xb4 0x00b7 # MIDDLE DOT
| 0xb5 0x00a6 # BROKEN BAR
| 0xb6 0x00b6 # PILCROW SIGN
| 0xb7 0x2022 # BULLET
| 0xb8 0x201a # SINGLE LOW-9 QUOTATION MARK
| 0xb9 0x201e # DOUBLE LOW-9 QUOTATION MARK
| 0xba 0x201d # RIGHT DOUBLE QUOTATION MARK
| 0xbb 0x00bb # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
| 0xbc 0x2026 # HORIZONTAL ELLIPSIS
| 0xbd 0x2030 # PER MILLE SIGN
| 0xbe 0x00ac # NOT SIGN
| 0xbf 0x00bf # INVERTED QUESTION MARK
| 0xc0 0x00b9 # SUPERSCRIPT ONE
| 0xc1 0x02cb # MODIFIER LETTER GRAVE ACCENT
| 0xc2 0x00b4 # ACUTE ACCENT
| 0xc3 0x02c6 # MODIFIER LETTER CIRCUMFLEX ACCENT
| 0xc4 0x02dc # SMALL TILDE
| 0xc5 0x00af # MACRON
| 0xc6 0x02d8 # BREVE
| 0xc7 0x02d9 # DOT ABOVE
| 0xc8 0x00a8 # DIAERESIS
| 0xc9 0x00b2 # SUPERSCRIPT TWO
| 0xca 0x02da # RING ABOVE
| 0xcb 0x00b8 # CEDILLA
| 0xcc 0x00b3 # SUPERSCRIPT THREE
| 0xcd 0x02dd # DOUBLE ACUTE ACCENT
| 0xce 0x02db # OGONEK
| 0xcf 0x02c7 # CARON
| 0xd0 0x2014 # EM DASH
| 0xd1 0x00b1 # PLUS-MINUS SIGN
| 0xd2 0x00bc # VULGAR FRACTION ONE QUARTER
| 0xd3 0x00bd # VULGAR FRACTION ONE HALF
| 0xd4 0x00be # VULGAR FRACTION THREE QUARTERS
| 0xd5 0x00e0 # LATIN SMALL LETTER A WITH GRAVE
| 0xd6 0x00e1 # LATIN SMALL LETTER A WITH ACUTE
| 0xd7 0x00e2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
| 0xd8 0x00e3 # LATIN SMALL LETTER A WITH TILDE
| 0xd9 0x00e4 # LATIN SMALL LETTER A WITH DIAERESIS
| 0xda 0x00e5 # LATIN SMALL LETTER A WITH RING ABOVE
| 0xdb 0x00e7 # LATIN SMALL LETTER C WITH CEDILLA
| 0xdc 0x00e8 # LATIN SMALL LETTER E WITH GRAVE
| 0xdd 0x00e9 # LATIN SMALL LETTER E WITH ACUTE
| 0xde 0x00ea # LATIN SMALL LETTER E WITH CIRCUMFLEX
| 0xdf 0x00eb # LATIN SMALL LETTER E WITH DIAERESIS
| 0xe0 0x00ec # LATIN SMALL LETTER I WITH GRAVE
| 0xe1 0x00c6 # LATIN CAPITAL LETTER AE
| 0xe2 0x00ed # LATIN SMALL LETTER I WITH ACUTE
| 0xe3 0x00aa # FEMININE ORDINAL INDICATOR
| 0xe4 0x00ee # LATIN SMALL LETTER I WITH CIRCUMFLEX
| 0xe5 0x00ef # LATIN SMALL LETTER I WITH DIAERESIS
| 0xe6 0x00f0 # LATIN SMALL LETTER ETH
| 0xe7 0x00f1 # LATIN SMALL LETTER N WITH TILDE
| 0xe8 0x0141 # LATIN CAPITAL LETTER L WITH STROKE
| 0xe9 0x00d8 # LATIN CAPITAL LETTER O WITH STROKE
| 0xea 0x0152 # LATIN CAPITAL LIGATURE OE
| 0xeb 0x00ba # MASCULINE ORDINAL INDICATOR
| 0xec 0x00f2 # LATIN SMALL LETTER O WITH GRAVE
| 0xed 0x00f3 # LATIN SMALL LETTER O WITH ACUTE
| 0xee 0x00f4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
| 0xef 0x00f5 # LATIN SMALL LETTER O WITH TILDE
| 0xf0 0x00f6 # LATIN SMALL LETTER O WITH DIAERESIS
| 0xf1 0x00e6 # LATIN SMALL LETTER AE
| 0xf2 0x00f9 # LATIN SMALL LETTER U WITH GRAVE
| 0xf3 0x00fa # LATIN SMALL LETTER U WITH ACUTE
| 0xf4 0x00fb # LATIN SMALL LETTER U WITH CIRCUMFLEX
| 0xf5 0x0131 # LATIN SMALL LETTER DOTLESS I
| 0xf6 0x00fc # LATIN SMALL LETTER U WITH DIAERESIS
| 0xf7 0x00fd # LATIN SMALL LETTER Y WITH ACUTE
| 0xf8 0x0142 # LATIN SMALL LETTER L WITH STROKE
| 0xf9 0x00f8 # LATIN SMALL LETTER O WITH STROKE
| 0xfa 0x0153 # LATIN SMALL LIGATURE OE
| 0xfb 0x00df # LATIN SMALL LETTER SHARP S
| 0xfc 0x00fe # LATIN SMALL LETTER THORN
| 0xfd 0x00ff # LATIN SMALL LETTER Y WITH DIAERESIS
| 0xfe 0xfffd # .notdef, REPLACEMENT CHARACTER
| 0xff 0xfffd # .notdef, REPLACEMENT CHARACTER
`----
1] http://www.cl.cam.ac.uk/~mgk25/download/uniset.tar.gz
-JimC
--
James Cloos <cloos at jhcloos.com> OpenPGP: 1024D/ED7DAEA6
More information about the tex-live
mailing list