TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Vision Transformer with Squeeze and Excitation fo...

Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition

Mouath Aouayeb, Wassim Hamidouche, Catherine Soladie, Kidiyo Kpalma, Renaud Seguier

2021-07-07Facial Expression Recognition (FER)
PaperPDF

Abstract

As various databases of facial expressions have been made accessible over the last few decades, the Facial Expression Recognition (FER) task has gotten a lot of interest. The multiple sources of the available databases raised several challenges for facial recognition task. These challenges are usually addressed by Convolution Neural Network (CNN) architectures. Different from CNN models, a Transformer model based on attention mechanism has been presented recently to address vision tasks. One of the major issue with Transformers is the need of a large data for training, while most FER databases are limited compared to other vision applications. Therefore, we propose in this paper to learn a vision Transformer jointly with a Squeeze and Excitation (SE) block for FER task. The proposed method is evaluated on different publicly available FER databases including CK+, JAFFE,RAF-DB and SFEW. Experiments demonstrate that our model outperforms state-of-the-art methods on CK+ and SFEW and achieves competitive results on JAFFE and RAF-DB.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingCK+Accuracy (7 emotion)99.8ViT + SE
Facial Recognition and ModellingRaFDAccuracy87.22ViT + SE
Facial Recognition and ModellingJAFFEAccuracy94.83ViT
Facial Recognition and ModellingSFEWAccuracy54.29ViT + SE
Face ReconstructionCK+Accuracy (7 emotion)99.8ViT + SE
Face ReconstructionRaFDAccuracy87.22ViT + SE
Face ReconstructionJAFFEAccuracy94.83ViT
Face ReconstructionSFEWAccuracy54.29ViT + SE
Facial Expression Recognition (FER)RaFDAccuracy87.22ViT + SE
Facial Expression Recognition (FER)CK+Accuracy (7 emotion)99.8ViT + SE
Facial Expression Recognition (FER)JAFFEAccuracy94.83ViT
Facial Expression Recognition (FER)SFEWAccuracy54.29ViT + SE
3DCK+Accuracy (7 emotion)99.8ViT + SE
3DRaFDAccuracy87.22ViT + SE
3DJAFFEAccuracy94.83ViT
3DSFEWAccuracy54.29ViT + SE
3D Face ModellingRaFDAccuracy87.22ViT + SE
3D Face ModellingCK+Accuracy (7 emotion)99.8ViT + SE
3D Face ModellingJAFFEAccuracy94.83ViT
3D Face ModellingSFEWAccuracy54.29ViT + SE
3D Face ReconstructionCK+Accuracy (7 emotion)99.8ViT + SE
3D Face ReconstructionRaFDAccuracy87.22ViT + SE
3D Face ReconstructionJAFFEAccuracy94.83ViT
3D Face ReconstructionSFEWAccuracy54.29ViT + SE

Related Papers

Multimodal Prompt Alignment for Facial Expression Recognition2025-06-26EfficientFER: EfficientNetv2 Based Deep Learning Approach for Facial Expression Recognition2025-06-02TKFNet: Learning Texture Key Factor Driven Feature for Facial Expression Recognition2025-05-15Achieving 3D Attention via Triplet Squeeze and Excitation Block2025-05-09SDAFE: A Dual-filter Stable Diffusion Data Augmentation Method for Facial Expression Recognition2025-04-06Evaluating Facial Expression Recognition Datasets for Deep Learning: A Benchmark Study with Novel Similarity Metrics2025-03-26Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data2025-03-26V-NAW: Video-based Noise-aware Adaptive Weighting for Facial Expression Recognition2025-03-20