Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-17921 merge DDL v47 branch to main #4087

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

CLDR-17921 merge DDL v47 branch to main #4087

wants to merge 4 commits into from

Conversation

srl295
Copy link
Member

@srl295 srl295 commented Sep 27, 2024

To review:

  • please go to the Commits tab and add comments on the individual PRs or tickets.

To update this branch:

$ git fetch upstream
$ git merge -m 'CLDR-17921 merge from main' upstream/main
$ git push

CLDR-17921

  • This PR completes the ticket.

ALLOW_MANY_COMMITS=true
DISABLE_JIRA_ISSUE_MATCH=true

conradarcturus and others added 2 commits September 18, 2024 10:21
If we re-run ConvertLanguageData on unrelated data, it will update the order and values of some other data -- this fixes inconsistencies with the XML outputs to match expectations. The biggest change was updating values in `language_script.tsv` to demote script variations to secondary when they really are not expected. Furthermore I added explicit annotations to `country_language_population.tsv` when the writing system for a country was a variant.

Scripts ran:

 mvn package -DskipTests=true
 java -jar tools/cldr-code/target/cldr-code.jar ConvertLanguageData
 java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelySubtags
@srl295 srl295 added the ddl DDL-SC specific work label Sep 27, 2024
@srl295 srl295 marked this pull request as draft September 27, 2024 17:19
Copy link
Member

@macchiati macchiati left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@macchiati
Copy link
Member

macchiati commented Sep 30, 2024 via email

El Salvador has a number of local languages not yet included in CLDR -- this adds them.

See https://unicode-org.atlassian.net/browse/CLDR-11567 and https://translatorswithoutborders.org/language-data-for-el-salvador

Re-generated the xml with mvn package -DskipTests=true &&  java -jar tools/cldr-code/target/cldr-code.jar ConvertLanguageData &&  java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelySubtags
`zlm` and `apd` are not languages of Togo -- there's a copy-paste error in the Likely Subtags overrides.

This fixes `apd` (Sudanese Arabic) by adding population counts -- I'll note that the ethnologue estimate is 48,000,000 million people, but the last Sudan census we have in the data shows only ~41 million people in the country -- so I want to be cautious. Since I had problems finding a reliable census I just duplicated the Standard Arabic value since likely most people in Sudan that use Arabic speak the Sudanese dialect but write Standard Arabic/ar.

For `zlm` -- Malay (individual language) that's even harder to get a thorough population value so I just fixed the override entry in GenerateLikelySubtags.
@srl295 srl295 changed the title 🚧 CLDR-17921 — preview of DDL v47 branch CLDR-17921 merge DDL v47 branch to main Oct 2, 2024
@srl295 srl295 marked this pull request as ready for review October 2, 2024 21:13
@srl295
Copy link
Member Author

srl295 commented Oct 2, 2024

any concerns with merging? maybe confirm at TC or infra?

Copy link
Contributor

@conradarcturus conradarcturus left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for merging the changes back to main! After you merge it I'll rebase my PRs in progress.

@srl295
Copy link
Member Author

srl295 commented Oct 14, 2024

@macchiati any objection on merging this to main?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ddl DDL-SC specific work
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants