TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Accurate Facial Landmark Detection via Cascaded Tr...

Towards Accurate Facial Landmark Detection via Cascaded Transformers

Hui Li, Zidong Guo, Seon-Min Rhee, Seungju Han, Jae-Joon Han

2022-08-23CVPR 2022 1Face AlignmentFacial Landmark Detection
PaperPDF

Abstract

Accurate facial landmarks are essential prerequisites for many tasks related to human faces. In this paper, an accurate facial landmark detector is proposed based on cascaded transformers. We formulate facial landmark detection as a coordinate regression task such that the model can be trained end-to-end. With self-attention in transformers, our model can inherently exploit the structured relationships between landmarks, which would benefit landmark detection under challenging conditions such as large pose and occlusion. During cascaded refinement, our model is able to extract the most relevant image features around the target landmark for coordinate prediction, based on deformable attention mechanism, thus bringing more accurate alignment. In addition, we propose a novel decoder that refines image features and landmark positions simultaneously. With few parameter increasing, the detection performance improves further. Our model achieves new state-of-the-art performance on several standard facial landmark detection benchmarks, and shows good generalization ability in cross-dataset evaluation.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingAFLW-19NME_diag (%, Full)1.37DTLD+
Facial Recognition and Modelling300WNME_inter-ocular (%, Challenge)4.48DTLD+
Facial Recognition and Modelling300WNME_inter-ocular (%, Common)2.6DTLD+
Facial Recognition and Modelling300WNME_inter-ocular (%, Full)2.96DTLD+
Facial Recognition and ModellingWFLWFR@10 (inter-ocular)2.68DTLD+
Facial Recognition and ModellingWFLWNME (inter-ocular)4.05DTLD+
Facial Recognition and Modelling300W Split 2AUC@7 (box)70.9DTLD-s
Facial Recognition and Modelling300W Split 2NME (box)2.05DTLD-s
Face Reconstruction300WNME_inter-ocular (%, Challenge)4.48DTLD+
Face Reconstruction300WNME_inter-ocular (%, Common)2.6DTLD+
Face Reconstruction300WNME_inter-ocular (%, Full)2.96DTLD+
Face Reconstruction300W Split 2AUC@7 (box)70.9DTLD-s
Face Reconstruction300W Split 2NME (box)2.05DTLD-s
Face ReconstructionAFLW-19NME_diag (%, Full)1.37DTLD+
Face ReconstructionWFLWFR@10 (inter-ocular)2.68DTLD+
Face ReconstructionWFLWNME (inter-ocular)4.05DTLD+
3D300WNME_inter-ocular (%, Challenge)4.48DTLD+
3D300WNME_inter-ocular (%, Common)2.6DTLD+
3D300WNME_inter-ocular (%, Full)2.96DTLD+
3D300W Split 2AUC@7 (box)70.9DTLD-s
3D300W Split 2NME (box)2.05DTLD-s
3DAFLW-19NME_diag (%, Full)1.37DTLD+
3DWFLWFR@10 (inter-ocular)2.68DTLD+
3DWFLWNME (inter-ocular)4.05DTLD+
3D Face ModellingAFLW-19NME_diag (%, Full)1.37DTLD+
3D Face Modelling300WNME_inter-ocular (%, Challenge)4.48DTLD+
3D Face Modelling300WNME_inter-ocular (%, Common)2.6DTLD+
3D Face Modelling300WNME_inter-ocular (%, Full)2.96DTLD+
3D Face ModellingWFLWFR@10 (inter-ocular)2.68DTLD+
3D Face ModellingWFLWNME (inter-ocular)4.05DTLD+
3D Face Modelling300W Split 2AUC@7 (box)70.9DTLD-s
3D Face Modelling300W Split 2NME (box)2.05DTLD-s
3D Face ReconstructionAFLW-19NME_diag (%, Full)1.37DTLD+
3D Face Reconstruction300WNME_inter-ocular (%, Challenge)4.48DTLD+
3D Face Reconstruction300WNME_inter-ocular (%, Common)2.6DTLD+
3D Face Reconstruction300WNME_inter-ocular (%, Full)2.96DTLD+
3D Face ReconstructionWFLWFR@10 (inter-ocular)2.68DTLD+
3D Face ReconstructionWFLWNME (inter-ocular)4.05DTLD+
3D Face Reconstruction300W Split 2AUC@7 (box)70.9DTLD-s
3D Face Reconstruction300W Split 2NME (box)2.05DTLD-s

Related Papers

MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution2025-06-17Towards Large-Scale Pose-Invariant Face Recognition Using Face Defrontalization2025-06-04HonestFace: Towards Honest Face Restoration with One-Step Diffusion Model2025-05-24Multimodal Emotion Coupling via Speech-to-Facial and Bodily Gestures in Dyadic Interaction2025-05-08Semantic Style Transfer for Enhancing Animal Facial Landmark Detection2025-05-08SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users2025-04-14Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment2025-03-28Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model2025-01-01