diff --git a/docs/ldml/tr35.md b/docs/ldml/tr35.md index a4a5cd7b08e..188db553919 100644 --- a/docs/ldml/tr35.md +++ b/docs/ldml/tr35.md @@ -19,11 +19,6 @@ This document describes an XML format (_vocabulary_) for the exchange of structured locale data. This format is used in the [Unicode Common Locale Data Repository](https://www.unicode.org/cldr/). -_Note:_ -Some links may lead to in-development or older -versions of the data files. -See for up-to-date CLDR release data. - ### _Status_ _This is a draft document which may be updated, replaced, or superseded by other documents at any time. @@ -559,7 +554,11 @@ A _Unicode **BCP 47** locale identifier_ (unicode_script_subtag. - a "root" subtag (the "und" unicode_language_subtag is used instead of "root"). -A well-formed _Unicode BCP 47 locale identifier_ is also a well-formed _BCP 47 language tag_. The reverse, however, is not guaranteed; a well-formed _BCP 47 language tag_ might not be a well-formed _Unicode BCP 47 locale identifier_. +A well-formed _Unicode BCP 47 locale identifier_ is always a well-formed _BCP 47 language tag_. +The reverse, however, is not guaranteed; +a _BCP 47 language tag_ that contains an extlang subtag, an irregular subtag, or an initial 'x' subtag would not be a well-formed _Unicode BCP 47 locale identifier_ +— for details see [BCP 47 Conformance]#bcp-47-conformance). +However, any _BCP 47 language tag_ can easily converted to a _Unicode BCP 47 locale identifier_ as specified in [BCP 47 Language Tag Conversion](#bcp-47-language-tag-conversion). A _Unicode **CLDR** locale identifier_ (`unicode_cldr_locale_id`) is a `unicode_locale_id` that meets the following additional constraints: - [ wfc: The EBNF `sep` is restricted to only [_] in unicode_language_id and `unicode_locale_id`.] @@ -4319,7 +4318,7 @@ Updates to [LDML Conformance](#Conformance) including ### Locale Identifiers and Inheritance Modifications - Clarified definitions of _Unicode BCP 47 locale identifier_ and _Unicode CLDR locale identifier_, moving them to [Unicode CLDR locale identifier](#unicode-locale-identifier) -- Clarified useage of [Special Script Codes](#unicode-locale-identifier). +- Clarified usage of [Special Script Codes](#special-script-codes). - Added definition of [Ordered Elements](#definitions), replacing the obsolete definition of _blocking_ elements. - Clarified the usage of the `path` attribute with [aliases](#element-alias). @@ -4328,7 +4327,7 @@ Updates to [LDML Conformance](#Conformance) including Significant updates to [Message Format](tr35-messageFormat.md#Contents) - Removed all of the reserved and private use syntax constructs, simplifying the grammar. - Changed the structure of the .match (selector) to require use of local or input declarations. This is a breaking change for existing messages. -- Added support for bidirectional isolates and marks and clarified whitespace handling to better enable messages that contains right-to-left identifiers and text. +- Added support for bidirectional isolates and marks, and clarified whitespace handling to better enable messages that contains right-to-left identifiers and text. ### Date Modifications - Added a tech preview section on [semantic skeletons](tr35-dates.md#Semantic_Skeletons), allowing for less data and faster performance in formatting dates.