Public Git Hosting - glibc.git/commit

commit	aceda10bd5131cf716830827d66da9c671dec649
author	Mike FABIAN <mfabian@redhat.com>
	Thu, 1 Jun 2023 15:02:44 +0000 (1 17:02 +0200)
committer	Mike FABIAN <mfabian@redhat.com>
	Thu, 21 Sep 2023 08:34:35 +0000 (21 10:34 +0200)
tree	85fce5090f9abd310d3a9e0f0e9748d729f98ac8	tree \| snapshot (tar.gz zip)
parent	672b91ba1060887aa8897d0b98af83b96d4a52b0	commit \| diff

Adapt collation in th_TH locale to use the iso14651_t1_common file and sync the collation with CLDR

I made it to agree as much as possible with the rules from CLDR (see:
https://github.com/unicode-org/cldr/blob/main/common/collation/th.xml).

It seems to be impossible to follow the CLDR rules

&[before 1]๚<ฯ # should be "variable"

and

&๛<ๆ # should be "variable"

exactly though. These ask for a primary difference in punctuation
characters whose primary weight should be "IGNORE". But using a
secondary differnence instead still sorts the test data correctly and
the previously used collation in th_TH used tertiary differences for
these characters.

There was old localedata/th_TH.in test data in TIS-620 encoding which
was not used (it was not in the localedata/Makefile). I converted this
to UTF-8 and moved it to localedata/th_TH.UTF-8.in and added it to
localedata/Makefile.

Using the existing collation rules in the th_TH locale did not sort that
test file completely correct, I think my new collation rules based on
iso14651_t1 are better.

localedata/Makefile		diff \| blob \| blame \| history
localedata/locales/th_TH		diff \| blob \| blame \| history
localedata/th_TH.UTF-8.in	[new file with mode: 0644]	blob