Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/CREMA-D

CREMA-D

Audio

CREMA-D is an emotional multimodal actor data set of 7,442 original clips from 91 actors. These clips were from 48 male and 43 female actors between the ages of 20 and 74 coming from a variety of races and ethnicities (African America, Asian, Caucasian, Hispanic, and Unspecified).

Actors spoke from a selection of 12 sentences. The sentences were presented using one of six different emotions (Anger, Disgust, Fear, Happy, Neutral, and Sad) and four different emotion levels (Low, Medium, High, and Unspecified).

Participants rated the emotion and emotion levels based on the combined audiovisual presentation, the video alone, and the audio alone. Due to the large number of ratings needed, this effort was crowd-sourced and a total of 2443 participants each rated 90 unique clips, 30 audio, 30 visual, and 30 audio-visual. 95% of the clips have more than 7 ratings.

Benchmarks

1 Image, 2*2 Stitchi/EmoAcc 1 Image, 2*2 Stitchi/FID 1 Image, 2*2 Stitchi/LSE-C 10-shot image generation/EmoAcc 10-shot image generation/FID 10-shot image generation/LSE-C 3D/UAR 3D/EmoAcc 3D/FID 3D/LSE-C 3D Face Modelling/UAR 3D Face Modelling/EmoAcc 3D Face Modelling/FID 3D Face Modelling/LSE-C 3D Face Reconstruction/UAR 3D Face Reconstruction/EmoAcc 3D Face Reconstruction/FID 3D Face Reconstruction/LSE-C Audio Classification/Accuracy Classification/Accuracy Emotion Recognition/Accuracy Emotion Recognition/WAR Face Generation/EmoAcc Face Generation/FID Face Generation/LSE-C Face Reconstruction/UAR Face Reconstruction/EmoAcc Face Reconstruction/FID Face Reconstruction/LSE-C Facial Expression Recognition (FER)/UAR Facial Recognition and Modelling/UAR Facial Recognition and Modelling/EmoAcc Facial Recognition and Modelling/FID Facial Recognition and Modelling/LSE-C Few-Shot Learning/Top-1 Accuracy(5-Way-1-Shot)Image Generation/EmoAcc Image Generation/FID Image Generation/LSE-C Meta-Learning/Top-1 Accuracy(5-Way-1-Shot)Self-Supervised Learning/Accuracy Speech Emotion Recognition/Accuracy Talking Face Generation/EmoAcc Talking Face Generation/FID Talking Face Generation/LSE-C

Statistics

Papers: 28
Benchmarks: 44

Links

Tasks

1 Image, 2*2 Stitchi 10-shot image generation 3D 3D Face Modelling 3D Face Reconstruction Audio Classification Classification Emotion Recognition Face Generation Face Reconstruction Facial Expression Recognition (FER)Facial Recognition and Modelling Few-Shot Audio Classification Few-Shot Learning Image Generation Meta-Learning Self-Supervised Learning Speech Emotion Recognition Talking Face Generation Video Emotion Recognition