You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! As Artem correctly stated, we have uploaded the text only for crowd part of the dataset, which can be found in the crowd.zip. During the preparation of the dataset, we asked the first group of people to pronounce these texts, however, the second group of annotators only assessed the emotion of the utterance, without taking into account the correspondence between the spoken text and what was supposed to be pronounced.
We did not transcribe the texts from the podcasts part of the dataset, so we did not upload any texts for it. However, we plan to recognize all utterances through our ASR and share the synthetic annotations
Hello. Where can I get text data in the Dusha dataset?
The text was updated successfully, but these errors were encountered: