Dusha
Dusha Crowd, Dusha Podcast
AudioTextsCustomIntroduced 2022-12-23
Dusha is a dataset for speech emotion recognition (SER) tasks. The corpus contains approximately 350 hours of data, more than 300 000 audio recordings with Russian speech and their transcripts. It is annotated using a crowd-sourcing platform and includes two subsets: acted and real-life.
Source: Large Raw Emotional Dataset with Aggregation Mechanism
Related Benchmarks
Dusha Crowd/Emotion Recognition/Macro F1Dusha Crowd/Emotion Recognition/UADusha Crowd/Emotion Recognition/WADusha Crowd/Speech Emotion Recognition/Macro F1Dusha Crowd/Speech Emotion Recognition/UADusha Crowd/Speech Emotion Recognition/WADusha Podcast/Emotion Recognition/Macro F1Dusha Podcast/Emotion Recognition/UADusha Podcast/Emotion Recognition/WADusha Podcast/Speech Emotion Recognition/Macro F1Dusha Podcast/Speech Emotion Recognition/UADusha Podcast/Speech Emotion Recognition/WA