TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Joint Voxel and Coordinate Regression for Accurate 3D Faci...

Joint Voxel and Coordinate Regression for Accurate 3D Facial Landmark Localization

Hongwen Zhang, Qi Li, Zhenan Sun

2018-01-28Face AlignmentregressionFacial Landmark DetectionDepth Estimation3D Facial Landmark Localization
PaperPDF

Abstract

3D face shape is more expressive and viewpoint-consistent than its 2D counterpart. However, 3D facial landmark localization in a single image is challenging due to the ambiguous nature of landmarks under 3D perspective. Existing approaches typically adopt a suboptimal two-step strategy, performing 2D landmark localization followed by depth estimation. In this paper, we propose the Joint Voxel and Coordinate Regression (JVCR) method for 3D facial landmark localization, addressing it more effectively in an end-to-end fashion. First, a compact volumetric representation is proposed to encode the per-voxel likelihood of positions being the 3D landmarks. The dimensionality of such a representation is fixed regardless of the number of target landmarks, so that the curse of dimensionality could be avoided. Then, a stacked hourglass network is adopted to estimate the volumetric representation from coarse to fine, followed by a 3D convolution network that takes the estimated volume as input and regresses 3D coordinates of the face shape. In this way, the 3D structural constraints between landmarks could be learned by the neural network in a more efficient manner. Moreover, the proposed pipeline enables end-to-end training and improves the robustness and accuracy of 3D facial landmark localization. The effectiveness of our approach is validated on the 3DFAW and AFLW2000-3D datasets. Experimental results show that the proposed method achieves state-of-the-art performance in comparison with existing methods.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingAFLW2000-3DGTE7.28JVCR
Facial Recognition and ModellingAFLW2000-3DGTE7.28JVCR
Facial Recognition and Modelling3DFAWCVGTCE3.46JVCR
Facial Recognition and Modelling3DFAWGTE4.35JVCR
Facial Landmark DetectionAFLW2000-3DGTE7.28JVCR
Facial Landmark DetectionAFLW2000-3DGTE7.28JVCR
Facial Landmark Detection3DFAWCVGTCE3.46JVCR
Facial Landmark Detection3DFAWGTE4.35JVCR
Face ReconstructionAFLW2000-3DGTE7.28JVCR
Face ReconstructionAFLW2000-3DGTE7.28JVCR
Face Reconstruction3DFAWCVGTCE3.46JVCR
Face Reconstruction3DFAWGTE4.35JVCR
3DAFLW2000-3DGTE7.28JVCR
3DAFLW2000-3DGTE7.28JVCR
3D3DFAWCVGTCE3.46JVCR
3D3DFAWGTE4.35JVCR
3D Face ModellingAFLW2000-3DGTE7.28JVCR
3D Face ModellingAFLW2000-3DGTE7.28JVCR
3D Face Modelling3DFAWCVGTCE3.46JVCR
3D Face Modelling3DFAWGTE4.35JVCR
3D Face ReconstructionAFLW2000-3DGTE7.28JVCR
3D Face ReconstructionAFLW2000-3DGTE7.28JVCR
3D Face Reconstruction3DFAWCVGTCE3.46JVCR
3D Face Reconstruction3DFAWGTE4.35JVCR

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16