1 #=======================================================================
2 # FTP file name: CORPCHAR.TXT
4 # Contents: Registry (external version) of Apple use of
5 # Unicode corporate-zone characters.
7 # Copyright: (c) 1994-1998 by Apple Computer, Inc., all rights
10 # Contacts: Peter Edberg <pedberg@apple.com>
11 # Julio Gonzalez <juliog@apple.com>
12 # John Jenkins <jenkins@apple.com>
16 # b02 1998-Aug-18 Expanded usage of 0xF8A0. Matches internal
18 # n11 1998-Feb-05 Minor update to header comments
19 # n09 1997-Dec-14 Update to match internal registry <n23>:
20 # Add source hint 0xF850, transcoding hints 0xF860-
21 # 0xF86B and 0xF870-0xF872, deprecate almost all
22 # other non-hint corporate characters.
23 # n08 1997-Jul-17 Update to match internal registry <n13>:
24 # Add characters for Mac OS Chinese, Korean & Farsi.
25 # Add CJK source hints.
26 # Deprecate some characters in favor of combinations
27 # of standard characters and transcoding hints.
28 # Change header format.
29 # n04 1995-Nov-15 Update to match internal registry <n8>:
30 # Add characters for Mac OS Hebrew and Thai.
31 # n02 1995-Apr-18 First version. Matches internal registry <n5>.
36 # Apple, the Apple logo, and Macintosh are trademarks of Apple
37 # Computer, Inc., registered in the United States and other countries.
38 # Unicode is a trademark of Unicode Inc. For the sake of brevity,
39 # throughout this document, "Macintosh" can be used to refer to
40 # Macintosh computers and "Unicode" can be used to refer to the
43 # Apple makes no warranty or representation, either express or
44 # implied, with respect to these tables, their quality, accuracy, or
45 # fitness for a particular purpose. In no event will Apple be liable
46 # for direct, indirect, special, incidental, or consequential damages
47 # resulting from any defect or inaccuracy in this document or the
48 # accompanying tables.
50 # These mapping tables and character lists are subject to change.
51 # The latest tables should be available from the following:
53 # <ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
54 # <ftp://dev.apple.com/devworld/Technical_Documentation/Misc._Standards/>
56 # For general information about Mac OS encodings and these mapping
57 # tables, see the file "README.TXT".
62 # Two tab-separated columns;
63 # '#' begins a comment which continues to the end of the line.
64 # Column #1 is the Unicode corporate character code point
66 # Column #2 is a comment containing:
67 # 1) an informal name describing the Unicode corporate character,
68 # or if it is deprecated, information about what to use
70 # 2) optionally, another '#', followed by information on which
71 # Mac OS encodings use the Unicode corporate character, and -
72 # if relevant - the Mac OS code points that correspond to the
73 # corporate character.
75 # The entries are in Unicode order.
76 #_______________________________________________________________________
78 # The block of 16 characters 0xF850-0xF85F is for source hint characters.
79 # These have no display (like zero-width no-break space). If they appear
80 # in text, they can only be mapped to tables that include them. If a run
81 # of Unicode characters such as Han characters could otherwise be mapped
82 # to any of several encodings, including one of these hint characters can
83 # force the text to be mapped only to an encoding whose mapping table
84 # includes the hint character. Once they have forced mapping to a particular
85 # encoding, they no longer apply (they don't need to be cancelled); if a
86 # subsequent character cannot be mapped to that encoding, it may be mapped
87 # to another encoding. Currently source hints are mainly defined for CJK
88 # source disambiguation.
89 # NOTE: These are only defined for application developers who have requested
90 # them. The Mac OS Text Encoding Converter does not generate these when
91 # converting from other CJK encodings to Unicode. However, it will handle
92 # these characters correctly when converting from Unicode to other encodings.
93 0xF850 # source hint: Reset, try all candidate encodings in preferred order.
94 0xF85C # source hint: Chinese simplified
95 0xF85D # source hint: Chinese traditional
96 0xF85E # source hint: Japanese
97 0xF85F # source hint: Korean
99 # The block of 32 characters 0xF860-0xF87F is for transcoding hints.
100 # These are used in combination with standard Unicode characters to force
101 # them to be treated in a special way for mapping to other encodings;
102 # they have no other effect.
104 # 0xF870-0xF87F are "variant tags" - they are like combining characters,
105 # and can follow a standard Unicode (or a sequence consisting of a base
106 # character and other combining characters) to tag it so that it will be
107 # unique, treated in a special way for transcoding. These always terminate
108 # a sequence of combining characters.
110 # 0xF860-0xF86B are "grouping hints" - they precede a group of two to
111 # four standard Unicode characters to indicate that they are treated as a
112 # group for transcoding. This grouping overrides any other combining
115 # Here are the ones defined so far:
116 0xF860 # transcoding hint: group next 2 characters # Japanese,Korean
117 0xF861 # transcoding hint: group next 3 characters # Japanese,Korean
118 0xF862 # transcoding hint: group next 4 characters # Japanese,Korean
119 0xF863 # transcoding hint: group next 4 characters, alt1 # Korean
120 0xF864 # transcoding hint: group next 4 characters, alt2 # Korean
121 0xF865 # transcoding hint: group next 4 characters, alt3 # Korean
122 0xF866 # transcoding hint: group next 4 characters, alt4 # Korean
123 0xF867 # transcoding hint: group next 2 characters, alt1 # Korean
124 0xF868 # transcoding hint: group next 2 characters, alt2 # Korean
125 0xF869 # transcoding hint: group next 2 characters, alt3 # Korean
126 0xF86A # transcoding hint: group next 2 characters, RL # Hebrew
127 0xF86B # transcoding hint: group next 4 characters, RL # Farsi variant
129 0xF870 # transcoding hint: variant tag 16 # Symbol, Korean
130 0xF871 # transcoding hint: variant tag 15 # Symbol, Korean
131 0xF872 # transcoding hint: variant tag 14 # Symbol
132 0xF873 # transcoding hint: variant tag 13 # Korean, Thai
133 0xF874 # transcoding hint: variant tag 12 # Korean, Thai
134 0xF875 # transcoding hint: variant tag 11 # Korean, Thai
135 0xF876 # transcoding hint: variant tag 10 # Korean
136 0xF877 # transcoding hint: variant tag 9 # Korean
137 0xF878 # transcoding hint: variant tag 8 # Korean
138 0xF879 # transcoding hint: variant tag 7 # Korean
139 0xF87A # transcoding hint: variant tag 6 # Korean
140 0xF87B # transcoding hint: variant tag 5 # Korean
141 0xF87C # transcoding hint: variant tag 4 # ChineseTrad, Korean, Dingbats
142 0xF87D # transcoding hint: variant tag 3 # ChineseTrad
143 0xF87E # transcoding hint: variant tag 2 # Chinese,Japanese
144 0xF87F # transcoding hint: variant tag 1 # CJK,Symbol,Dingbats,Hebrew
146 # The following (2) are metrics "characters" so applications can get the
147 # height and width of double-byte character glyphs by measuring the glyph of a
148 # one-byte character (e.g. calling CharWidth for character 0x82 in a Chinese
149 # Traditional font); this approach assumes that the glyphs for all double-byte
150 # characters in a font have the same metrics, which is currently true. Note
151 # that the width-metric character glyphs are used differently for TrueType and
152 # old-style bitmap fonts; for TrueType fonts the metric glyph width is equal
153 # to the full width of a double-byte character glyph, while for FBIT/FDEF
154 # bitmap fonts the metric glyph width is half the width of a double-byte
156 0xF880 # height-metric character for double-byte fonts # Chinese Simp&Trad-0x81
157 0xF881 # width-metric character for double-byte fonts # Chinese Simp&Trad-0x82
159 # The following (2) are for the TrueType variant of Mac OS Farsi.
160 # NOTE: 0xF883 is deprecated in favor of a combination of standard
161 # characters and transcoding hint. The deprecated character will still
162 # be loosely mapped to the appropriate Mac OS Farsi character.
163 0xF882 # Arabic ligature "peace on him" # Farsi(TrueType variant)-0x8B
164 0xF883 # deprecated, use 0xF86B+0x0631+0x064A+0x0627+0x0644 # Farsi(TrueType variant)-0xA4
166 # The following (22) are for the Mac OS Thai encoding.
167 # In this encoding, positional variants of upper vowels, tone marks,
168 # and other marks are normally handled automatically by WorldScript I.
169 # However, the Thai-DTP keyboard allows the codes for the positional
170 # variants to be entered directly, so they must be treated as
171 # characters. When the abstract character is treated as a positional
172 # variant, it has the right (and high, if relevant) position.
173 # NOTE: These are now all deprecated in favor of combinations of standard
174 # characters and transcoding hints. The deprecated characters will still
175 # be loosely mapped to the appropriate Mac OS Thai character.
176 0xF884 # deprecated, use 0x0E31+0xF874 # Thai-0x92
177 0xF885 # deprecated, use 0x0E34+0xF874 # Thai-0x94
178 0xF886 # deprecated, use 0x0E35+0xF874 # Thai-0x95
179 0xF887 # deprecated, use 0x0E36+0xF874 # Thai-0x96
180 0xF888 # deprecated, use 0x0E37+0xF874 # Thai-0x97
181 0xF889 # deprecated, use 0x0E47+0xF874 # Thai-0x93
182 0xF88A # deprecated, use 0x0E48+0xF874 # Thai-0x98
183 0xF88B # deprecated, use 0x0E48+0xF873 # Thai-0x88
184 0xF88C # deprecated, use 0x0E48+0xF875 # Thai-0x83
185 0xF88D # deprecated, use 0x0E49+0xF874 # Thai-0x99
186 0xF88E # deprecated, use 0x0E49+0xF873 # Thai-0x89
187 0xF88F # deprecated, use 0x0E49+0xF875 # Thai-0x84
188 0xF890 # deprecated, use 0x0E4A+0xF874 # Thai-0x9A
189 0xF891 # deprecated, use 0x0E4A+0xF873 # Thai-0x8A
190 0xF892 # deprecated, use 0x0E4A+0xF875 # Thai-0x85
191 0xF893 # deprecated, use 0x0E4B+0xF874 # Thai-0x9B
192 0xF894 # deprecated, use 0x0E4B+0xF873 # Thai-0x8B
193 0xF895 # deprecated, use 0x0E4B+0xF875 # Thai-0x86
194 0xF896 # deprecated, use 0x0E4C+0xF874 # Thai-0x9C
195 0xF897 # deprecated, use 0x0E4C+0xF873 # Thai-0x8C
196 0xF898 # deprecated, use 0x0E4C+0xF875 # Thai-0x87
197 0xF899 # deprecated, use 0x0E4D+0xF874 # Thai-0x8F
199 # The following (6) are for the Mac OS Hebrew encoding. Four of
200 # these are for the obsolete "canoral" codes that were used before
201 # System 7.1/Worldscript to control positioning of nikud marks (points).
202 # In the future these 4 code points may be redefined.
203 # NOTE: Some of these are deprecated in favor of a combination of standard
204 # character and transcoding hint. The deprecated characters will still
205 # be loosely mapped to the appropriate Mac OS Hebrew character.
206 0xF89A # deprecated, use 0xF86A+0x05DC+0x05B9 # Hebrew-0xC0
207 0xF89B # Hebrew canoral 1 # Hebrew-0xC2
208 0xF89C # Hebrew canoral 2 # Hebrew-0xC3
209 0xF89D # Hebrew canoral 3 # Hebrew-0xC4
210 0xF89E # Hebrew canoral 4 # Hebrew-0xC5
211 0xF89F # deprecated, use 0x05B8+0xF87F # Hebrew-0xDE
213 # The following (1) is for mapping the single undefined code point in
214 # the Mac OS Greek and Turkish encodings, thus permitting full
215 # round-trip fidelity. This character is also used for mapping EURO SIGN
216 # when mapping to Unicode 1.1 (e.g. for Mac OS Roman and Symbol).
217 0xF8A0 # undefined1 # Greek-0xFF, Turkish-0xF5
218 # also EURO SIGN for Unicode 1.1 # Roman-0xDB, Symbol-0xA0
220 # The following (54) are for the Mac OS Japanese encoding.
221 # part 1 - Apple corporate Unicode chars for Mac OS Japanese extended
222 # characters not in Unicode.
223 # NOTE: These are now all deprecated in favor of combinations of standard
224 # characters and transcoding hints. The deprecated characters will still
225 # be loosely mapped to the appropriate Mac OS Japanese character.
226 0xF8A1 # deprecated, use 0xF860+0x0030+0x002E # Jpn-0x8591
227 0xF8A2 # deprecated, use 0xF862+0x0058+0x0049+0x0049+0x0049 # Jpn-0x85AB
228 0xF8A3 # deprecated, use 0xF861+0x0058+0x0049+0x0056 # Jpn-0x85AC
229 0xF8A4 # deprecated, use 0xF860+0x0058+0x0056 # Jpn-0x85AD
230 0xF8A5 # deprecated, use 0xF862+0x0078+0x0069+0x0069+0x0069 # Jpn-0x85BF
231 0xF8A6 # deprecated, use 0xF861+0x0078+0x0069+0x0076 # Jpn-0x85C0
232 0xF8A7 # deprecated, use 0xF860+0x0078+0x0076 # Jpn-0x85C1
233 0xF8A8 # deprecated, use 0xFF4D+0xF87F # Jpn-0x8645
234 0xF8A9 # deprecated, use 0xFF47+0xF87F # Jpn-0x864B
235 0xF8AA # deprecated, use 0xFF4C+0xF87F # Jpn-0x8650
236 0xF8AB # deprecated, use 0xF860+0x0054+0x0042 # Jpn-0x865D
237 0xF8AC # deprecated, use 0xF861+0x0046+0x0041+0x0058 # Jpn-0x869E
238 0xF8AD # deprecated, use 0xF860+0x2193+0x2191 # Jpn-0x86CE
239 0xF8AE # deprecated, use 0x21E8+0xF87A # Jpn-0x86D3
240 0xF8AF # deprecated, use 0x21E6+0xF87A # Jpn-0x86D4
241 0xF8B0 # deprecated, use 0x21E7+0xF87A # Jpn-0x86D5
242 0xF8B1 # deprecated, use 0x21E9+0xF87A # Jpn-0x86D6
243 0xF8B2 # deprecated, use 0xF862+0x6709+0x9650+0x4F1A+0x793E # Jpn-0x87FB
244 0xF8B3 # deprecated, use 0xF862+0x8CA1+0x56E3+0x6CD5+0x4EBA # Jpn-0x87FC
245 0xF8B4 # deprecated, use 0x301E # Jpn-0x8855
246 # part 2 - Apple corporate Unicode chars for Mac OS Japanese vertical
247 # forms not in Unicode.
248 # NOTE: These are now all deprecated in favor of combinations of standard
249 # characters and transcoding hints. The deprecated characters will still
250 # be loosely mapped to the appropriate Mac OS Japanese character.
251 0xF8B5 # deprecated, use 0x3001+0xF87E # Jpn-0xEB41
252 0xF8B6 # deprecated, use 0x3002+0xF87E # Jpn-0xEB42
253 0xF8B7 # deprecated, use 0x203E+0xF87E # Jpn-0xEB50
254 0xF8B8 # deprecated, use 0x30FC+0xF87E # Jpn-0xEB5B
255 0xF8B9 # deprecated, use 0x2010+0xF87E # Jpn-0xEB5D
256 0xF8BA # deprecated, use 0x301C+0xF87E # Jpn-0xEB60
257 0xF8BB # deprecated, use 0x2016+0xF87E # Jpn-0xEB61
258 0xF8BC # deprecated, use 0xFF5C+0xF87E # Jpn-0xEB62
259 0xF8BD # deprecated, use 0x22EF+0xF87E # Jpn-0xEB63
260 0xF8BE # deprecated, use 0xFF3B+0xF87E # Jpn-0xEB6D
261 0xF8BF # deprecated, use 0xFF3D+0xF87E # Jpn-0xEB6E
262 0xF8C0 # deprecated, use 0xFF1D+0xF87E # Jpn-0xEB81
263 0xF8C1 # deprecated, use 0x3041+0xF87E # Jpn-0xEC9F
264 0xF8C2 # deprecated, use 0x3043+0xF87E # Jpn-0xECA1
265 0xF8C3 # deprecated, use 0x3045+0xF87E # Jpn-0xECA3
266 0xF8C4 # deprecated, use 0x3047+0xF87E # Jpn-0xECA5
267 0xF8C5 # deprecated, use 0x3049+0xF87E # Jpn-0xECA7
268 0xF8C6 # deprecated, use 0x3063+0xF87E # Jpn-0xECC1
269 0xF8C7 # deprecated, use 0x3083+0xF87E # Jpn-0xECE1
270 0xF8C8 # deprecated, use 0x3085+0xF87E # Jpn-0xECE3
271 0xF8C9 # deprecated, use 0x3087+0xF87E # Jpn-0xECE5
272 0xF8CA # deprecated, use 0x308E+0xF87E # Jpn-0xECEC
273 0xF8CB # deprecated, use 0x30A1+0xF87E # Jpn-0xED40
274 0xF8CC # deprecated, use 0x30A3+0xF87E # Jpn-0xED42
275 0xF8CD # deprecated, use 0x30A5+0xF87E # Jpn-0xED44
276 0xF8CE # deprecated, use 0x30A7+0xF87E # Jpn-0xED46
277 0xF8CF # deprecated, use 0x30A9+0xF87E # Jpn-0xED48
278 0xF8D0 # deprecated, use 0x30C3+0xF87E # Jpn-0xED62
279 0xF8D1 # deprecated, use 0x30E3+0xF87E # Jpn-0xED83
280 0xF8D2 # deprecated, use 0x30E5+0xF87E # Jpn-0xED85
281 0xF8D3 # deprecated, use 0x30E7+0xF87E # Jpn-0xED87
282 0xF8D4 # deprecated, use 0x30EE+0xF87E # Jpn-0xED8E
283 0xF8D5 # deprecated, use 0x30F5+0xF87E # Jpn-0xED95
284 0xF8D6 # deprecated, use 0x30F6+0xF87E # Jpn-0xED96
286 # The following (14) are for the Mac OS Dingbats encoding.
287 # NOTE: These are now all deprecated in favor of standard characters or
288 # combinations of standard characters and transcoding hints. The
289 # deprecated characters will still be loosely mapped to the appropriate
290 # Mac OS Dingbats character.
291 0xF8D7 # deprecated, use 0x0028 # Dingbats-0x80
292 0xF8D8 # deprecated, use 0x0029 # Dingbats-0x81
293 0xF8D9 # deprecated, use 0x0028+0xF87F # Dingbats-0x82
294 0xF8DA # deprecated, use 0x0029+0xF87F # Dingbats-0x83
295 0xF8DB # deprecated, use 0x3008 # Dingbats-0x84
296 0xF8DC # deprecated, use 0x3009 # Dingbats-0x85
297 0xF8DD # deprecated, use 0x2039 # Dingbats-0x86
298 0xF8DE # deprecated, use 0x203A # Dingbats-0x87
299 0xF8DF # deprecated, use 0x3008+0xF87C # Dingbats-0x88
300 0xF8E0 # deprecated, use 0x3009+0xF87C # Dingbats-0x89
301 0xF8E1 # deprecated, use 0x3014 # Dingbats-0x8A
302 0xF8E2 # deprecated, use 0x3015 # Dingbats-0x8B
303 0xF8E3 # deprecated, use 0x007B # Dingbats-0x8C
304 0xF8E4 # deprecated, use 0x007D # Dingbats-0x8D
306 # The following (26) are for the Mac OS Symbol encoding.
307 # NOTE: Some of these are deprecated in favor of combinations of standard
308 # characters and transcoding hints. The deprecated characters will still
309 # be loosely mapped to the appropriate Mac OS Symbol character.
310 0xF8E5 # radical extender # Symbol-0x60
311 0xF8E6 # vertical arrow extender # Symbol-0xBD
312 0xF8E7 # horizontal arrow extender # Symbol-0xBE
313 0xF8E8 # deprecated, use 0x00AE+0xF87F # Symbol-0xE2
314 0xF8E9 # deprecated, use 0x00A9+0xF87F # Symbol-0xE3
315 0xF8EA # deprecated, use 0x2122+0xF87F # Symbol-0xE4
316 0xF8EB # deprecated, use 0x0028+0xF870 # Symbol-0xE6
317 0xF8EC # deprecated, use 0x0028+0xF871 # Symbol-0xE7
318 0xF8ED # deprecated, use 0x0028+0xF872 # Symbol-0xE8
319 0xF8EE # deprecated, use 0x005B+0xF870 # Symbol-0xE9
320 0xF8EF # deprecated, use 0x005B+0xF871 # Symbol-0xEA
321 0xF8F0 # deprecated, use 0x005B+0xF872 # Symbol-0xEB
322 0xF8F1 # deprecated, use 0x007B+0xF870 # Symbol-0xEC
323 0xF8F2 # deprecated, use 0x007B+0xF871 # Symbol-0xED
324 0xF8F3 # deprecated, use 0x007B+0xF872 # Symbol-0xEE
325 0xF8F4 # curly bracket extender # Symbol-0xEF
326 0xF8F5 # deprecated, use 0x222B+0xF871 # Symbol-0xF4
327 0xF8F6 # deprecated, use 0x0029+0xF870 # Symbol-0xF6
328 0xF8F7 # deprecated, use 0x0029+0xF871 # Symbol-0xF7
329 0xF8F8 # deprecated, use 0x0029+0xF872 # Symbol-0xF8
330 0xF8F9 # deprecated, use 0x005D+0xF870 # Symbol-0xF9
331 0xF8FA # deprecated, use 0x005D+0xF871 # Symbol-0xFA
332 0xF8FB # deprecated, use 0x005D+0xF872 # Symbol-0xFB
333 0xF8FC # deprecated, use 0x007D+0xF870 # Symbol-0xFC
334 0xF8FD # deprecated, use 0x007D+0xF871 # Symbol-0xFD
335 0xF8FE # deprecated, use 0x007D+0xF872 # Symbol-0xFE
337 # The following (1) is for the Mac OS Roman encoding
338 # (also used in Symbol & Croatian).
339 # NOTE: The graphic image associated with the Apple logo character is
340 # not authorized for use without permission of Apple, and unauthorized
341 # use might constitute trademark infringement.
342 0xF8FF # Apple logo # Roman-0xF0, Symbol-0xF0, Croatian-0xD8