TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Representation Learning and Identity Adversarial Training ...

Representation Learning and Identity Adversarial Training for Facial Behavior Understanding

Mang Ning, Albert Ali Salah, Itir Onal Ertugrul

2024-07-15Representation LearningFacial Action Unit DetectionFacial Expression Recognition (FER)
PaperPDFCode(official)

Abstract

Facial Action Unit (AU) detection has gained significant attention as it enables the breakdown of complex facial expressions into individual muscle movements. In this paper, we revisit two fundamental factors in AU detection: diverse and large-scale data and subject identity regularization. Motivated by recent advances in foundation models, we highlight the importance of data and introduce Face9M, a diverse dataset comprising 9 million facial images from multiple public sources. Pretraining a masked autoencoder on Face9M yields strong performance in AU detection and facial expression tasks. More importantly, we emphasize that the Identity Adversarial Training (IAT) has not been well explored in AU tasks. To fill this gap, we first show that subject identity in AU datasets creates shortcut learning for the model and leads to sub-optimal solutions to AU predictions. Secondly, we demonstrate that strong IAT regularization is necessary to learn identity-invariant features. Finally, we elucidate the design space of IAT and empirically show that IAT circumvents the identity-based shortcut learning and results in a better solution. Our proposed methods, Facial Masked Autoencoder (FMAE) and IAT, are simple, generic and effective. Remarkably, the proposed FMAE-IAT approach achieves new state-of-the-art F1 scores on BP4D (67.1\%), BP4D+ (66.8\%), and DISFA (70.1\%) databases, significantly outperforming previous work. We release the code and model at https://github.com/forever208/FMAE-IAT.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingRAF-DBOverall Accuracy93.45FMAE
Facial Recognition and ModellingAffectNetAccuracy (8 emotion)64.79FMAE
Facial Recognition and ModellingDISFAAverage F170.1FMAE_IAT
Facial Recognition and ModellingDISFAAverage F168.7FMAE
Facial Recognition and ModellingBP4D+Average F166.8FMAE-IAT
Facial Recognition and ModellingBP4D+Average F166.2FMAE
Facial Recognition and ModellingBP4DAverage F167.1FMAE-IAT
Facial Recognition and ModellingBP4DAverage F166.6FMAE
Face ReconstructionRAF-DBOverall Accuracy93.45FMAE
Face ReconstructionAffectNetAccuracy (8 emotion)64.79FMAE
Face ReconstructionDISFAAverage F170.1FMAE_IAT
Face ReconstructionDISFAAverage F168.7FMAE
Face ReconstructionBP4D+Average F166.8FMAE-IAT
Face ReconstructionBP4D+Average F166.2FMAE
Face ReconstructionBP4DAverage F167.1FMAE-IAT
Face ReconstructionBP4DAverage F166.6FMAE
Facial Expression Recognition (FER)RAF-DBOverall Accuracy93.45FMAE
Facial Expression Recognition (FER)AffectNetAccuracy (8 emotion)64.79FMAE
3DRAF-DBOverall Accuracy93.45FMAE
3DAffectNetAccuracy (8 emotion)64.79FMAE
3DDISFAAverage F170.1FMAE_IAT
3DDISFAAverage F168.7FMAE
3DBP4D+Average F166.8FMAE-IAT
3DBP4D+Average F166.2FMAE
3DBP4DAverage F167.1FMAE-IAT
3DBP4DAverage F166.6FMAE
3D Face ModellingRAF-DBOverall Accuracy93.45FMAE
3D Face ModellingAffectNetAccuracy (8 emotion)64.79FMAE
3D Face ModellingDISFAAverage F170.1FMAE_IAT
3D Face ModellingDISFAAverage F168.7FMAE
3D Face ModellingBP4D+Average F166.8FMAE-IAT
3D Face ModellingBP4D+Average F166.2FMAE
3D Face ModellingBP4DAverage F167.1FMAE-IAT
3D Face ModellingBP4DAverage F166.6FMAE
3D Face ReconstructionRAF-DBOverall Accuracy93.45FMAE
3D Face ReconstructionAffectNetAccuracy (8 emotion)64.79FMAE
3D Face ReconstructionDISFAAverage F170.1FMAE_IAT
3D Face ReconstructionDISFAAverage F168.7FMAE
3D Face ReconstructionBP4D+Average F166.8FMAE-IAT
3D Face ReconstructionBP4D+Average F166.2FMAE
3D Face ReconstructionBP4DAverage F167.1FMAE-IAT
3D Face ReconstructionBP4DAverage F166.6FMAE

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15Dual Dimensions Geometric Representation Learning Based Document Dewarping2025-07-11