Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tolong Siki #732

Merged
merged 13 commits into from
Nov 15, 2024
Merged

Tolong Siki #732

merged 13 commits into from
Nov 15, 2024

Conversation

josh-hadley
Copy link
Collaborator

@josh-hadley josh-hadley commented Mar 10, 2024

See UTC-174-C22: Provisionally assign 54 code points U+11DB0..U+11DE9 in a new Tolong Siki block at U+11DB0..U+11DEF, for 54 Tolong Siki characters as described in L2/23-024 and section 6 of L2/23-012.

https://github.com/unicode-org/sah/issues/225
https://github.com/unicode-org/utc-release-management/issues/21

[UTC-181-C61] Consensus: UTC accepts for encoding in Unicode 17.0 the following 228 characters for new scripts for which code points have previously been provisionally assigned: 50 characters for Beria Erfe script, 40 characters for Chisoi script, 29 characters for Sidetic script, 55 characters for Tai Yo script, and 54 characters for Tolong Siki script.

11DD6;TOLONG SIKI LETTER X;Lo;0;L;;;;;N;;;;;
11DD7;TOLONG SIKI LETTER RR;Lo;0;L;;;;;N;;;;;
11DD8;TOLONG SIKI LETTER RRH;Lo;0;L;;;;;N;;;;;
11DD9;TOLONG SIKI SIGN SELA;Lo;0;L;;;;;N;;;;;
Copy link
Member

@eggrobin eggrobin Mar 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a vowel length mark, see p. 5 and the chart on p. 13 of the proposal L2/23-024.

I think this should be gc=Lm, and in PropList.txt, should have the Diacritic and Extender properties (like U+02D0).

The proposal bundles U+11DDA with it. For that one I don’t know whether it is more appropriate to treat it as a letter that is a glottal stop or as modifier that does mysterious syllable coda things. @Ken-Whistler, what do you think?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it (hecakā) is a vowel length mark, as contrasted with selā, which is. The glottal stop seems to be a full segment, although perhaps in restricted distribution. See the discussion of Kurukh and Brahui phonology on Wikipedia, for instance, which shows length distinctions (and in some cases nasalization distinctions) for vowels, which when marked in the orthography can be considered vowel modifiers and/or extenders, etc. But I think it is safest to just leave the glottal stop as Lo and treat it as another letter.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this change also require changes to LineBreak.txt?

josh-hadley and others added 2 commits March 24, 2024 15:37
Based on feedback from @eggrobin and @Ken-Whistler :
- set gc=Lm (UnicodeData.txt)
- set Diacritic and Extender (PropsList.txt)
@eggrobin eggrobin marked this pull request as ready for review November 8, 2024 21:27
@eggrobin eggrobin requested review from markusicu and removed request for markusicu November 14, 2024 22:59
@eggrobin eggrobin merged commit 96e92df into main Nov 15, 2024
23 of 24 checks passed
@eggrobin eggrobin deleted the jh-tolong-siki branch November 15, 2024 00:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants