TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/RAVDESS

RAVDESS

Ryerson Audio-Visual Database of Emotional Speech and Song

AudioSpeechVideosAttribution-NonCommercial-ShareAlike 4.0 InternationalIntroduced 2020-12-31

The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) contains 7,356 files (total size: 24.8 GB). The database contains 24 professional actors (12 female, 12 male), vocalizing two lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity (normal, strong), with an additional neutral expression. All conditions are available in three modality formats: Audio-only (16bit, 48kHz .wav), Audio-Video (720p H.264, AAC 48kHz, .mp4), and Video-only (no sound). Note, there are no song files for Actor_18.

Paper: The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English Source:

Benchmarks

3D/UAR3D Face Modelling/UAR3D Face Reconstruction/UARAudio Classification/Top-1 AccuracyClassification/Top-1 AccuracyEmotion Classification/Top-1 AccuracyEmotion Recognition/AccuracyEmotion Recognition/WAREmotion Recognition/F1 ScoreEmotion Recognition/PrecisionEmotion Recognition/RecallEmotion Recognition/F1Face Reconstruction/UARFacial Expression Recognition (FER)/UARFacial Recognition and Modelling/UARSpeech Emotion Recognition/AccuracySpeech Emotion Recognition/F1 ScoreSpeech Emotion Recognition/PrecisionSpeech Emotion Recognition/RecallSpeech Emotion Recognition/F1Text Classification/Top-1 Accuracy

Statistics

Papers
27
Benchmarks
21

Links

Homepage

Tasks

3D3D Face Modelling3D Face ReconstructionAudio ClassificationClassificationEmotion ClassificationEmotion RecognitionFace ReconstructionFacial Emotion RecognitionFacial Expression Recognition (FER)Facial Recognition and ModellingMusic Emotion RecognitionSpeech Emotion RecognitionText ClassificationVideo Emotion Recognition