Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Sephardic IPA schema #75

Open
charlesLoder opened this issue Aug 6, 2023 · 5 comments
Open

Add Sephardic IPA schema #75

charlesLoder opened this issue Aug 6, 2023 · 5 comments

Comments

@charlesLoder
Copy link
Owner

It could be cool to have Sephardic but with IPA characters

@charlesLoder charlesLoder added this to the v2.5.0 milestone Aug 12, 2023
@charlesLoder charlesLoder modified the milestones: v2.5.0, v2.6.0 Aug 25, 2023
@johnlockejrr
Copy link

Very cool but... Sephardic which? Spanish, Italian, Maghrebi etc. Sephardic reading tradition, like the Ashkenazic is not an unitar tradition.

@charlesLoder
Copy link
Owner Author

Great question! And the answer is, it depends.

For context, the overall idea is to produce something closer to a transcription (à la the Tiberian schema) so it can be fed through a Text2Speech tool like Google's or Amazon's.

The trick is, however, that I have to find a language model that will accept IPA characters. Amazon has a generic Arabic model which is probably close enough.

I jotted this down a little while ago, but has fallen to the way-side for the moment.

@charlesLoder charlesLoder removed this from the v2.6.0 milestone Jan 5, 2024
@johnlockejrr
Copy link

Did you find a model that accepts IPA? I think of training one for Tiberian.

@charlesLoder
Copy link
Owner Author

Most of the TTS models accept IPA characters via SSML. The one with the best results (imo) was Azure's AI Speech.

Using SSML I could give it this using Tiberian pronunciation

<speak version="1.0" xmlns="https://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xml:lang="he-IL" style="documentary-narration">
    <voice name="he-IL-AvriNeural">
        <prosody rate="slow">
            <phoneme alphabet="ipa" ph="və.ha.ˈa.ʀɛts">
                וְהָאָ֗רֶץ
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="ha.jə.ˈtaː">
                הָיְתָ֥ה
            </phoneme>
            <break time="100ms"/>
            <phoneme alphabet="ipa" ph="ˈto.huː">
                תֹ֙הוּ֙
            </phoneme>
            <break time="100ms"/>
            <phoneme alphabet="ipa" ph="va.ˈvo.huː">
                וָבֹ֔הוּ
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="və.ˈxo.ʃɛx">
                וְחֹ֖שֶׁךְ
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="al.pə.ˈneɪ">
                עַל־פְּנֵ֣י
            </phoneme>
            <break time="100ms"/>
            <phoneme alphabet="ipa" ph="tə.ˈhoːm">
                תְהֹ֑ום
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="və.ˈʀuː.ax">
                וְר֣וּחַ
            </phoneme>
            <break time="100ms"/>
            <phoneme alphabet="ipa" ph="ɛ.lo.ˈhiːm">
                אֱלֹהִ֔ים
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="mə.ʀa.ˈxɛ.fɛt">
                מְרַחֶ֖פֶת
            </phoneme>
            <break time="250ms"/>
            <phoneme alphabet="ipa" ph="al.pə.ˈneɪ">
                עַל־פְּנֵ֥י
            </phoneme>
            <break time="100ms"/>
            <phoneme alphabet="ipa" ph="ha.ˈma.jiːm">
                הַמָּֽיִם׃
            </phoneme>
        </prosody>
    </voice>
</speak>

But the results were never quite good enough.

Khan's book and Kantor's site have audio, but I'm not sure if there is enough to train a model.

But don't let my ignorance stop you!

@charlesLoder
Copy link
Owner Author

Also, see this issue for some more #87

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants