TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semantic Alignment: Finding Semantically Consistent Ground...

Semantic Alignment: Finding Semantically Consistent Ground-truth for Facial Landmark Detection

Zhiwei Liu, Xiangyu Zhu, Guosheng Hu, Haiyun Guo, Ming Tang, Zhen Lei, Neil M. Robertson, Jinqiao Wang

2019-03-26CVPR 2019 6Face AlignmentFacial Landmark Detection
PaperPDF

Abstract

Recently, deep learning based facial landmark detection has achieved great success. Despite this, we notice that the semantic ambiguity greatly degrades the detection performance. Specifically, the semantic ambiguity means that some landmarks (e.g. those evenly distributed along the face contour) do not have clear and accurate definition, causing inconsistent annotations by annotators. Accordingly, these inconsistent annotations, which are usually provided by public databases, commonly work as the ground-truth to supervise network training, leading to the degraded accuracy. To our knowledge, little research has investigated this problem. In this paper, we propose a novel probabilistic model which introduces a latent variable, i.e. the 'real' ground-truth which is semantically consistent, to optimize. This framework couples two parts (1) training landmark detection CNN and (2) searching the 'real' ground-truth. These two parts are alternatively optimized: the searched 'real' ground-truth supervises the CNN training; and the trained CNN assists the searching of 'real' ground-truth. In addition, to recover the unconfidently predicted landmarks due to occlusion and low quality, we propose a global heatmap correction unit (GHCU) to correct outliers by considering the global face shape as a constraint. Extensive experiments on both image-based (300W and AFLW) and video-based (300-VW) databases demonstrate that our method effectively improves the landmark detection accuracy and achieves the state of the art performance.

Results

TaskDatasetMetricValueModel
Facial Recognition and Modelling300WNME_inter-pupil (%, Challenge)6.38HGs + SA + Norm + GHCU
Facial Recognition and Modelling300WNME_inter-pupil (%, Common)3.45HGs + SA + Norm + GHCU
Facial Recognition and Modelling300WNME_inter-pupil (%, Full)4.02HGs + SA + Norm + GHCU
Face Reconstruction300WNME_inter-pupil (%, Challenge)6.38HGs + SA + Norm + GHCU
Face Reconstruction300WNME_inter-pupil (%, Common)3.45HGs + SA + Norm + GHCU
Face Reconstruction300WNME_inter-pupil (%, Full)4.02HGs + SA + Norm + GHCU
3D300WNME_inter-pupil (%, Challenge)6.38HGs + SA + Norm + GHCU
3D300WNME_inter-pupil (%, Common)3.45HGs + SA + Norm + GHCU
3D300WNME_inter-pupil (%, Full)4.02HGs + SA + Norm + GHCU
3D Face Modelling300WNME_inter-pupil (%, Challenge)6.38HGs + SA + Norm + GHCU
3D Face Modelling300WNME_inter-pupil (%, Common)3.45HGs + SA + Norm + GHCU
3D Face Modelling300WNME_inter-pupil (%, Full)4.02HGs + SA + Norm + GHCU
3D Face Reconstruction300WNME_inter-pupil (%, Challenge)6.38HGs + SA + Norm + GHCU
3D Face Reconstruction300WNME_inter-pupil (%, Common)3.45HGs + SA + Norm + GHCU
3D Face Reconstruction300WNME_inter-pupil (%, Full)4.02HGs + SA + Norm + GHCU

Related Papers

MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution2025-06-17Towards Large-Scale Pose-Invariant Face Recognition Using Face Defrontalization2025-06-04HonestFace: Towards Honest Face Restoration with One-Step Diffusion Model2025-05-24Multimodal Emotion Coupling via Speech-to-Facial and Bodily Gestures in Dyadic Interaction2025-05-08Semantic Style Transfer for Enhancing Animal Facial Landmark Detection2025-05-08SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users2025-04-14Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment2025-03-28Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model2025-01-01