-
-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tolong Siki #732
Tolong Siki #732
Conversation
11DD6;TOLONG SIKI LETTER X;Lo;0;L;;;;;N;;;;; | ||
11DD7;TOLONG SIKI LETTER RR;Lo;0;L;;;;;N;;;;; | ||
11DD8;TOLONG SIKI LETTER RRH;Lo;0;L;;;;;N;;;;; | ||
11DD9;TOLONG SIKI SIGN SELA;Lo;0;L;;;;;N;;;;; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a vowel length mark, see p. 5 and the chart on p. 13 of the proposal L2/23-024.
I think this should be gc=Lm, and in PropList.txt, should have the Diacritic and Extender properties (like U+02D0).
The proposal bundles U+11DDA with it. For that one I don’t know whether it is more appropriate to treat it as a letter that is a glottal stop or as modifier that does mysterious syllable coda things. @Ken-Whistler, what do you think?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think it (hecakā) is a vowel length mark, as contrasted with selā, which is. The glottal stop seems to be a full segment, although perhaps in restricted distribution. See the discussion of Kurukh and Brahui phonology on Wikipedia, for instance, which shows length distinctions (and in some cases nasalization distinctions) for vowels, which when marked in the orthography can be considered vowel modifiers and/or extenders, etc. But I think it is safest to just leave the glottal stop as Lo and treat it as another letter.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this change also require changes to LineBreak.txt?
Based on feedback from @eggrobin and @Ken-Whistler : - set gc=Lm (UnicodeData.txt) - set Diacritic and Extender (PropsList.txt)
See UTC-174-C22: Provisionally assign 54 code points U+11DB0..U+11DE9 in a new Tolong Siki block at U+11DB0..U+11DEF, for 54 Tolong Siki characters as described in L2/23-024 and section 6 of L2/23-012.
https://github.com/unicode-org/sah/issues/225
https://github.com/unicode-org/utc-release-management/issues/21
[UTC-181-C61] Consensus: UTC accepts for encoding in Unicode 17.0 the following 228 characters for new scripts for which code points have previously been provisionally assigned: 50 characters for Beria Erfe script, 40 characters for Chisoi script, 29 characters for Sidetic script, 55 characters for Tai Yo script, and 54 characters for Tolong Siki script.