1 # $NetBSD: src/share/i18n/csmapper/KS/KSC5601%UCS.src,v 1.3 2003/09/26 17:58:42 tshiozak Exp $
2 # $DragonFly: src/share/i18n/csmapper/KS/Attic/KSC5601%UCS.src,v 1.1 2005/03/10 16:19:55 joerg Exp $
6 SRC_ZONE 0x21-0x7E / 0x21-0x7E / 8
13 # This mapping data is made from the mapping data provided by Unicode, Inc.
16 # Name: Unified Hangul (KS X 1001) to Unicode table
17 # Unicode version: 2.0
19 # Table format: Format A
21 # Authors: Jungshik Shin at jshin@pantheon.yale.edu
24 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
25 # No claims are made as to fitness for any particular purpose. No
26 # warranties of any kind are expressed or implied. The recipient
27 # agrees to determine applicability of information provided. If this
28 # file has been provided on magnetic media by Unicode, Inc., the sole
29 # remedy for any claim will be exchange of defective media within 90
32 # Recipient is granted the right to make copies in any form for
33 # internal distribution and to freely use the information supplied
34 # in the creation of products supporting Unicode. Unicode, Inc.
35 # specifically excludes the right to re-distribute this file directly
36 # to third parties or other organizations whether for profit or not.
38 # What is enclosed below is the mapping between KS X 1001(KS C 5601-1987
39 # and Unicode 2.0. It's automatically generated from KSC5601.TXT
40 # (at ftp://ftp.unicode.org/Public/MAPPING/EASTASIA/KSC) which is
41 # actually NOT the mapping between KS X 1001(KS C 5601-1992) and Unicode 2.0
42 # BUT the mapping table between UHC (Microsoft Unified Hangul Code)
43 # and Unicode 2.0. Hence, in this pacakge, I renamed it as UHC.TXT
45 # Please, note that there was a change in naming scheme of
46 # Korean standard for information exchange.
47 # What used to be in KS C 5[6-8]xx are now in KS X xxxx.
48 # See http://pantheon.yale.edu/~jshin/faq/qa8.html for more details.
50 # The Unix command used is
51 # egrep '^0x' < KSC5601.TXT | \
52 # egrep -v '^0x([8-9]...|A0..|..[4-9].|..A0)' | perl tab.pl
54 # where tab.pl is as following
58 # local($euck, $ucs4, @rest) = split;
59 # local($u)=hex($ucs4);
60 # local($k)=hex($euck);
61 # printf ("0x%04X 0x%04X %s\n",$k-0x8080, $u,join(' ',@rest));
64 # Column #1 : KS X 1001(KS C 5601-1992 excluding addtional Hangul
65 # syllables defined for Johab encoding in Annex 3)
67 # Column #2 : the Unicode (in hex as 0xXXXX)
68 # Column #3 : the Unicode name (following a comment sign, '#')
69 # The number of characters enumerated in this table is 8824,
70 # as listed in KS X 1001
72 # The entries are in KS X 1001 order
73 # You can use the following algorithms to convert the hex form
74 # of KS X 1001 to other forms
75 # To get EUC Korean(EUC-KR) code points, add 0x8080.
76 # To get row(Hang) and column(Yol) as used in KS X 1001 manual,
77 # first subtract 0x2020. Then
78 # the high and low bytes correspond to the row(Hang) and the column(Yol),