Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

地域詞彙Regional Lexicon for CN/HK/MO/MY/SG/TW #791

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

groverlynn
Copy link
Contributor

@groverlynn groverlynn commented Jul 16, 2023

Region-specific vocabulary on (1) toponym, (2) science and technology, (3) cultural and others
for each of the six regions (Mainland China/CN, Hong Kong/HK, Macau/MO, Malaysia/MY, Singapore/SG, and Taiwan/TW). resolves #506

Regarding variants, zh-MY and zh-SG adopt the same zh-Hans script as zh-CN; while zh-MO adopts the script of zh-HK.
However, zh-MY actually prefers not to merge multiple zh-Hant characters into one zh-Hans character (e.g. 髮 -> 发 but 發 -> 發). This is not yet supported.

new configurations:

  • s2cnp (adopt CN lexicon for any simplified Chinese)

  • s2myp (adopt MY lexicon for any simplified Chinese)

  • s2sgp (adopt SG lexicon for any simplified Chinese)

  • s2hkp (transform simplified Chinese to Traditional Chinese HK variants with HK lexicon)

  • s2mop (transform simplified Chinese to Traditional Chinese HK variants with MO lexicon)

  • s2twp (transform simplified Chinese to Traditional Chinese TW variants with TW lexicon)

  • t2hkp (adopt HK lexicon and HK variants for any traditional Chinese)

  • t2mop (adopt MO lexicon and HK variants for any traditional Chinese)

  • t2twp (adopt TW lexicon and TW variants for any traditional Chinese)

  • t2cnp (transform traditional Chinese to simplified Chinese with CN lexicon)

  • t2myp (transform traditional Chinese to simplified Chinese with MY lexicon)

  • t2sgp (transform traditional Chinese to simplified Chinese with SG lexicon)

  • hk2cnp (transform traditional Chinese HK variants to simplified Chinese with CN lexicon)

  • hk2myp (transform traditional Chinese HK variants to simplified Chinese with MY lexicon)

  • hk2sgp (transform traditional Chinese HK variants to simplified Chinese with SG lexicon)

  • tw2myp (transform traditional Chinese TW variants to simplified Chinese with MY lexicon)

  • tw2cnp (transform traditional Chinese TW variants to simplified Chinese with CN lexicon)

  • tw2sgp (transform traditional Chinese TW variants to simplified Chinese with SG lexicon)

region-specific vocabulary on (1) toponym, (2) science and technology, (3) cultural and others
for each of the six regions (Mainland China/CN, Hong Kong/HK, Macau/MO, Malaysia/MY, Singapore/SG, and Taiwan/TW)
@BYVoid
Copy link
Owner

BYVoid commented Jul 26, 2024

Where do they come from? Are they licensed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants