-
Notifications
You must be signed in to change notification settings - Fork 131
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Auto-transcribe sentences in Lingua Franca Nova's Latin orthography to Cyrillic, and vice versa #1958
Comments
Hello conlangbecca, thank you for your request. The information you gave me looks like such autotranscription system is quite feasible in Tatoeba. To get this done, we will need your help as described in the wiki. Please follow the instruction there and get back to us with the relevant data about Latin/Cyrillic transcription in LFN. |
Following the link you provided, I was able to write a simple conversion algorithm and check it against a few LFN sentence pairs on Tatoeba that were contributed in Latin and Cyrillic. Among these, I found a few sentences where my algorithm gets a different result than what was contributed on Tatoeba:
|
Based on http://www.elefen.org/vici/gramatica/en/spelling_and_pronunciation Included tests based on existing LFN sentence pairs on Tatoeba picked at random. The failing tests show edge cases that require to refine in the algorithm or to rethink of the conversion rules. Refs #1958.
Cyrillic transcriptions might not always be one-for-one with proper names, because sometimes people opt for a phonetic transcription into actual LFN phonology rather than a strict letter-for-letter transcription. The automatic transcription, though, should do the letter-for-letter transcription, which is completely acceptable. |
Thanks for clarifying, @conlangbecca. On a side note, I am mentioning #770 (and #76) because we have a number of LFN sentences that will become duplicates when this issue is solved as they will only differ in terms of script. |
Lingua Franca Nova has two orthographies, one Latin and one Cyrillic, with a one-to-one correspondence between letters in each orthography. A table can be found here: http://www.elefen.org/vici/gramatica/en/spelling_and_pronunciation
The practice up to now has been to contribute in both orthographies wherever possible, but I feel this is inefficient and ignores that there are already systems in place for auto-transcribing between Chinese Simplified and Chinese Traditional, among other orthographies. Could such a system be added for LFN? Thank you.
The text was updated successfully, but these errors were encountered: