Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speaker text for Dusha #3

Open
TPODAvia opened this issue Apr 22, 2023 · 2 comments
Open

Speaker text for Dusha #3

TPODAvia opened this issue Apr 22, 2023 · 2 comments

Comments

@TPODAvia
Copy link

Hello. Where can I get text data in the Dusha dataset?

@artsokol
Copy link
Collaborator

Hi! We have transcriptions for the crowd part only for the time being...

@kondrat1997
Copy link
Collaborator

Hi! As Artem correctly stated, we have uploaded the text only for crowd part of the dataset, which can be found in the crowd.zip. During the preparation of the dataset, we asked the first group of people to pronounce these texts, however, the second group of annotators only assessed the emotion of the utterance, without taking into account the correspondence between the spoken text and what was supposed to be pronounced.

We did not transcribe the texts from the podcasts part of the dataset, so we did not upload any texts for it. However, we plan to recognize all utterances through our ASR and share the synthetic annotations

@kondrat1997 kondrat1997 reopened this Apr 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants