TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Emotionally Enhanced Talking Face Generation

Emotionally Enhanced Talking Face Generation

Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Yi Yu, Yifang Yin, Rajiv Ratn Shah

2023-03-21Talking Head GenerationFace GenerationTalking Face Generation
PaperPDFCode(official)

Abstract

Several works have developed end-to-end pipelines for generating lip-synced talking faces with various real-world applications, such as teaching and language translation in videos. However, these prior works fail to create realistic-looking videos since they focus little on people's expressions and emotions. Moreover, these methods' effectiveness largely depends on the faces in the training dataset, which means they may not perform well on unseen faces. To mitigate this, we build a talking face generation framework conditioned on a categorical emotion to generate videos with appropriate expressions, making them more realistic and convincing. With a broad range of six emotions, i.e., \emph{happiness}, \emph{sadness}, \emph{fear}, \emph{anger}, \emph{disgust}, and \emph{neutral}, we show that our model can adapt to arbitrary identities, emotions, and languages. Our proposed framework is equipped with a user-friendly web interface with a real-time experience for talking face generation with emotions. We also conduct a user study for subjective evaluation of our interface's usability, design, and functionality. Project page: https://midas.iiitd.edu.in/emo/

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingCREMA-DEmoAcc83.2EmoGen
Facial Recognition and ModellingCREMA-DFID5.29EmoGen
Facial Recognition and ModellingCREMA-DLSE-C6.663EmoGen
Image GenerationCREMA-DEmoAcc83.2EmoGen
Image GenerationCREMA-DFID5.29EmoGen
Image GenerationCREMA-DLSE-C6.663EmoGen
Face GenerationCREMA-DEmoAcc83.2EmoGen
Face GenerationCREMA-DFID5.29EmoGen
Face GenerationCREMA-DLSE-C6.663EmoGen
Face ReconstructionCREMA-DEmoAcc83.2EmoGen
Face ReconstructionCREMA-DFID5.29EmoGen
Face ReconstructionCREMA-DLSE-C6.663EmoGen
3DCREMA-DEmoAcc83.2EmoGen
3DCREMA-DFID5.29EmoGen
3DCREMA-DLSE-C6.663EmoGen
3D Face ModellingCREMA-DEmoAcc83.2EmoGen
3D Face ModellingCREMA-DFID5.29EmoGen
3D Face ModellingCREMA-DLSE-C6.663EmoGen
3D Face ReconstructionCREMA-DEmoAcc83.2EmoGen
3D Face ReconstructionCREMA-DFID5.29EmoGen
3D Face ReconstructionCREMA-DLSE-C6.663EmoGen
Talking Face GenerationCREMA-DEmoAcc83.2EmoGen
Talking Face GenerationCREMA-DFID5.29EmoGen
Talking Face GenerationCREMA-DLSE-C6.663EmoGen
10-shot image generationCREMA-DEmoAcc83.2EmoGen
10-shot image generationCREMA-DFID5.29EmoGen
10-shot image generationCREMA-DLSE-C6.663EmoGen
1 Image, 2*2 StitchiCREMA-DEmoAcc83.2EmoGen
1 Image, 2*2 StitchiCREMA-DFID5.29EmoGen
1 Image, 2*2 StitchiCREMA-DLSE-C6.663EmoGen

Related Papers

Non-Adaptive Adversarial Face Generation2025-07-16MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding2025-07-08Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions2025-06-23Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation2025-06-02Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes2025-05-26DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations2025-05-23FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion2025-05-21FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images2025-05-12