Dusha

Dusha Crowd, Dusha Podcast

AudioTextsCustomIntroduced 2022-12-23

Dusha is a dataset for speech emotion recognition (SER) tasks. The corpus contains approximately 350 hours of data, more than 300 000 audio recordings with Russian speech and their transcripts. It is annotated using a crowd-sourcing platform and includes two subsets: acted and real-life.

Source: Large Raw Emotional Dataset with Aggregation Mechanism

Related Benchmarks

Dusha Crowd/Emotion Recognition/Macro F1 Dusha Crowd/Emotion Recognition/UA Dusha Crowd/Emotion Recognition/WA Dusha Crowd/Speech Emotion Recognition/Macro F1 Dusha Crowd/Speech Emotion Recognition/UA Dusha Crowd/Speech Emotion Recognition/WA Dusha Podcast/Emotion Recognition/Macro F1 Dusha Podcast/Emotion Recognition/UA Dusha Podcast/Emotion Recognition/WA Dusha Podcast/Speech Emotion Recognition/Macro F1 Dusha Podcast/Speech Emotion Recognition/UA Dusha Podcast/Speech Emotion Recognition/WA