-
-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is Russian to ASCII correct? #3
Comments
http://translit.cc/ for rescue, this specific case will be "lysaja gora". |
Thanks @kalessil I have merged the suggestions from "translit.ru" and "translit.cc" here: https://github.com/voku/portable-ascii/blob/master/src/voku/helper/data/ascii_by_languages.php#L468 |
There are multiple ways to tranlit and there are many GOSTs (state standards). Using the one for passports would produce "lysaia gora". |
I would use this one, actually. It is GOSTed (i.e. standardized). |
@Hlaford the problem is that there are multiple standards that are not deprecated. |
Do different standards all have the same language code? |
Yes. |
Hmm, I guess we just need to choose a standard. |
No, that won't work well. If it's something like generating slugs for an article title, that doesn't matter much which standard is used. If it's passport names, your users may get in trouble if you aren't using correct standard. If your service if exchanging info with state road police, it matters to use correct standard for road signs and street names. |
So solution would be allowing to choose a standard. |
+1 for "So solution would be allowing to choose a standard." |
@samdark Ok, so what standards do a normal application / user need? "Passport (2013)" + "GOST 7.79-2000(B)"?https://en.m.wikipedia.org/wiki/Romanization_of_Russian#content-collapsible-block-1 |
Yes. Would be enough for many typical cases. |
@samdark I added mappings for "ru__passport_2013" && "ru__gost_2000_b" (copy&past from wikipedia). Can you please take a look at it, here is a test case for the string "лысая гора": a2b1920 Mappings:
|
Looks OK to me. |
Thank you for the review & feedback. 👍 |
Example: Is this correct?
"лысая гора" -> "lysaja gora"
Can a native speaker, please check the character-replacement, thanks.
https://github.com/voku/portable-ascii/blob/master/src/voku/helper/data/ascii_by_languages.php#L468
The text was updated successfully, but these errors were encountered: