1 AkH-14-a1 acél ; The "AkH" tests are from:
3 AkH-14-a1 csók ; A magyar helyesírás szabályai, 12. kiadás
4 AkH-14-a1 gép ; [The Rules of Hungarian Orthography, 12th edition]
6 AkH-14-a1 kettő ; often referred to as akadémiai helyesírás (AkH.) [academic orthography]
8 AkH-14-a1 nyúl ; http://helyesiras.mta.hu/helyesiras/default/akh12
10 AkH-14-a1 öröm ; Alphabetical ordering described in #14-16.
12 AkH-14-a1 sokáig ; #14-a1: Sort based on first letter.
16 AkH-14-a2 jácint ; #14-a2: If no other difference, lowercase initial precedes uppercase.
24 AkH-14-a3 cudar ; #14-a3: Compound letters (cs, dz, dzs, gy, ly, ny, sz, ty, zs)
25 AkH-14-a3 cukor ; are sorted separately, after their first letter:
26 AkH-14-a3 cuppant ; a b c cs d dz dzs e f g gy h ... l ly m n ny o ... s sz t ty u ... z zs
36 AkH-14-b1 lom ; #14-b1: The first difference matters.
51 AkH-14-b2 kas ; #14-b2: If a compound letter is pronounced long, only the first letter
52 AkH-14-b2 Kasmír ; is duplicated in writing: <cs><cs> becomes ccs, <dzs><dzs> is ddzs etc.
53 AkH-14-b2 Kassák ; (unless it's at the boundary of a compound word where it's written out twice).
54 AkH-14-b2 kastély ; Sort according to the actual tokens, not the shorthand written form.
55 AkH-14-b2 kasza ; <k><a><sz><a>
56 AkH-14-b2 kaszinó ; <k><a><sz><i><n><ó>
57 AkH-14-b2 kassza ; <k><a><sz><sz><a>
58 AkH-14-b2 kaszt ; <k><a><sz><t>
63 AkH-14-b2 meny ; <m><e><ny>
64 AkH-14-b2 Menyhért ; <M><e><ny><h><é><r><t>
65 AkH-14-b2 mennybolt ; <m><e><ny><ny><b><o><l><t>
66 AkH-14-b2 mennyi ; <m><e><ny><ny><i>
67 AkH-14-b2 nagy ; <n><a><gy>
68 AkH-14-b2 naggyá ; <n><a><gy><gy><á>
69 AkH-14-b2 nagygyakorlat ; <n><a><gy><gy><a><k><o><r><l><a><t> (compound word: nagy+gyakorlat)
70 AkH-14-b2 naggyal ; <n><a><gy><gy><a><l>
71 AkH-14-b2 nagyít ; <n><a><gy><í><t>
75 AkH-14-c1 ír ; #14-c1: Vowels collate equally in pairs: a-á, e-é, i-í, o-ó, ö-ő, u-ú, ü-ű.
84 AkH-14-c2 Eger ; #14-c2: Short vowel (unaccented, or with diaeresis) comes first if that's the only difference.
102 AkH-14-d1 kis részben ; #14-d1: Spaces, hyphens are ignored.
105 AkH-14-d1 kis sorozat
106 AkH-14-d1 kissorozat-gyártás
107 AkH-14-d1 kis számban
112 AkH-14-d1 márvány sírkő
113 AkH-14-d1 Márvány-tenger
114 AkH-14-d1 márványtömb
115 AkH-14-d1 Márvány Zsolt
120 AkH-14-d1 Tisza Kálmán
121 AkH-14-d1 Tisza menti
126 AkH-15 cérna ; #15: Foreign accents are ignored, unless they're the only difference,
127 AkH-15 Černý ; in which case they are sorted after the Hungarian ones (in unspecified order).
150 alphabet a ; All the remaining tests were added by glibc.
152 alphabet aa ; a = á unless that's the only difference in which case a < á.
153 alphabet aá ; (Same for e = é, i = í, o = ó, ö = ő, u = ú, ü = ű below.)
154 alphabet áa ; Differences in accents matter from left to right.
161 alphabet cs ; <cs> -- or rarely <c><s>, can't tell for sure, assume <cs>.
162 alphabet csc ; <cs><c>
163 alphabet ccs ; <cs><cs> -- or rarely <c><cs>, can't tell for sure, assume <cs><cs>.
164 alphabet cscs ; <cs><cs> -- Make sure ccs and cscs don't collate as equal, see bug 13547.
165 alphabet ccsa ; <cs><cs><a> -- The order of ccs and cscs is not specified in the rules and is arbitrarily chosen by glibc.
166 alphabet cscsa ; <cs><cs><a>
167 alphabet csd ; <cs><d> -- (These comments also apply to all other compound letters below.)
170 alphabet dzd ; <dz><d>
171 alphabet ddz ; <dz><dz>
172 alphabet dzdz ; <dz><dz>
173 alphabet ddza ; <dz><dz><a>
174 alphabet dzdza ; <dz><dz><a>
175 alphabet dzdzs ; <dz><dzs>
176 alphabet dze ; <dz><e>
177 alphabet dzz ; <dz><z>
179 alphabet dzsdz ; <dzs><dz>
180 alphabet ddzs ; <dzs><dzs>
181 alphabet dzsdzs ; <dzs><dzs>
182 alphabet ddzsa ; <dzs><dzs><a>
183 alphabet dzsdzsa ; <dzs><dzs><a>
184 alphabet dzse ; <dzs><e>
197 alphabet gyg ; <gy><g>
198 alphabet ggy ; <gy><gy>
199 alphabet gygy ; <gy><gy>
200 alphabet ggya ; <gy><gy><a>
201 alphabet gygya ; <gy><gy><a>
202 alphabet gyh ; <gy><h>
217 alphabet lyl ; <ly><l>
218 alphabet lly ; <ly><ly>
219 alphabet lyly ; <ly><ly>
220 alphabet llya ; <ly><ly><a>
221 alphabet lylya ; <ly><ly><a>
222 alphabet lym ; <ly><m>
227 alphabet nyn ; <ny><n>
228 alphabet nny ; <ny><ny>
229 alphabet nyny ; <ny><ny>
230 alphabet nnya ; <ny><ny><a>
231 alphabet nynya ; <ny><ny><a>
232 alphabet nyo ; <ny><o>
241 alphabet ö ; ö = ő (unless that's the only difference), but these come strictly after o and ó.
254 alphabet szs ; <sz><s>
255 alphabet ssz ; <sz><sz>
256 alphabet szsz ; <sz><sz>
257 alphabet ssza ; <sz><sz><a>
258 alphabet szsza ; <sz><sz><a>
259 alphabet szt ; <sz><t>
263 alphabet tyt ; <ty><t>
264 alphabet tty ; <ty><ty>
265 alphabet tyty ; <ty><ty>
266 alphabet ttya ; <ty><ty><a>
267 alphabet tytya ; <ty><ty><a>
268 alphabet tyu ; <ty><u>
277 alphabet ü ; ü = ű (unless that's the only difference), but these come strictly after u and ú.
292 alphabet zsz ; <zs><z>
293 alphabet zzs ; <zs><zs>
294 alphabet zszs ; <zs><zs>
295 alphabet zzsa ; <zs><zs><a>
296 alphabet zszsa ; <zs><zs><a>
297 case a ; #14-a2 specifies that if the same word appears in lowercase as well as with
298 case A ; uppercase initial, the lowercase one is to be sorted first.
299 case á ; Arbitrarily extend this to all other weird combinations of upper- and lowercases in compound letters.
333 case ddzs ; <dzs><dzs>
445 foreign-a1 á ; More thorough tests for foreign accents (#15).
446 foreign-a1 à ; Each test consists of 4 lines. The foreign accent is in the middle two.
447 foreign-a1 àp ; That is, on their own they come after the Hungarian accent, but a
448 foreign-a1 áq ; subsequent difference (p and q) overrides this.
517 foreign-o1 ó ; The rules are not explicit whether foreign accents on top of o or u
518 foreign-o1 ò ; should be sorted among o-ó and u-ú, or among ö-ő and ü-ű, but the
519 foreign-o1 òp ; AkH #15 example with Møsstrand implicitly shows that it's the former.