1 # $NetBSD: UCS%GB12345.src,v 1.1 2006/11/23 03:25:24 tnozaki Exp $
2 # $DragonFly: src/share/i18n/csmapper/GB/UCS%GB12345.src,v 1.1 2008/04/10 10:21:07 hasso Exp $
6 SRC_ZONE 0x00A4 - 0xFFE5
13 # This mapping data is made from the mapping data provided by Unicode, Inc.
16 # Name: GB12345-80 to Unicode table (complete, hex format)
17 # Unicode version: 1.1
18 # Table version: 0.0d1
19 # Table format: Format A
20 # Date: 6 December 1993
21 # Author: Glenn Adams <glenn@metis.com>
22 # John H. Jenkins <John_Jenkins@taligent.com>
24 # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
26 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
27 # No claims are made as to fitness for any particular purpose. No
28 # warranties of any kind are expressed or implied. The recipient
29 # agrees to determine applicability of information provided. If this
30 # file has been provided on magnetic media by Unicode, Inc., the sole
31 # remedy for any claim will be exchange of defective media within 90
34 # Recipient is granted the right to make copies in any form for
35 # internal distribution and to freely use the information supplied
36 # in the creation of products supporting Unicode. Unicode, Inc.
37 # specifically excludes the right to re-distribute this file directly
38 # to third parties or other organizations whether for profit or not.
42 # This table contains the data Metis and Taligent currently have on how
43 # GB12345-90 characters map into Unicode.
45 # Format: Three tab-separated columns
46 # Column #1 is the GB12345 code (in hex as 0xXXXX)
47 # Column #2 is the Unicode (in hex as 0xXXXX)
48 # Column #3 the Unicode name (follows a comment sign, '#')
49 # The official names for Unicode characters U+4E00
50 # to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
51 # where XXXX is the code point. Including all these
52 # names in this file increases its size substantially
53 # and needlessly. The token "<CJK>" is used for the
54 # name of these characters. If necessary, it can be
55 # expanded algorithmically by a parser or editor.
57 # The entries are in GB12345 order
59 # The following algorithms can be used to change the hex form
60 # of GB12345 to other standard forms:
62 # To change hex to EUC form, add 0x8080
63 # To change hex to kuten form, first subtract 0x2020. Then
64 # the high and low bytes correspond to the ku and ten of
65 # the kuten form. For example, 0x2121 -> 0x0101 -> 0101;
66 # 0x777E -> 0x575E -> 8794
68 # Any comments or problems, contact <John_Jenkins@taligent.com>
222 0x2015 = 0x212A # fallback -> 0x2014
609 0x30FB = 0x2124 # fallback -> 0x00B7