TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Supervised Monocular 3D Face Reconstruction by Occlus...

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency

Jiaxiang Shang, Tianwei Shen, Shiwei Li, Lei Zhou, Mingmin Zhen, Tian Fang, Long Quan

2020-07-24ECCV 2020 8Face AlignmentSelf-Supervised LearningFace ReconstructionDepth Estimation3D Face Reconstruction
PaperPDFCode(official)

Abstract

Recent learning-based approaches, in which models are trained by single-view images have shown promising results for monocular 3D face reconstruction, but they suffer from the ill-posed face pose and depth ambiguity issue. In contrast to previous works that only enforce 2D feature constraints, we propose a self-supervised training architecture by leveraging the multi-view geometry consistency, which provides reliable constraints on face pose and depth estimation. We first propose an occlusion-aware view synthesis method to apply multi-view geometry consistency to self-supervised learning. Then we design three novel loss functions for multi-view consistency, including the pixel consistency loss, the depth consistency loss, and the facial landmark-based epipolar loss. Our method is accurate and robust, especially under large variations of expressions, poses, and illumination conditions. Comprehensive experiments on the face alignment and 3D face reconstruction benchmarks have demonstrated superiority over state-of-the-art methods. Our code and model are released in https://github.com/jiaxiangshang/MGCNet.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingREALYall1.787MGCNet
Facial Recognition and ModellingNoW BenchmarkMean Reconstruction Error (mm)1.87MGCNet
Facial Recognition and ModellingNoW BenchmarkMedian Reconstruction Error1.31MGCNet
Facial Recognition and ModellingNoW BenchmarkStdev Reconstruction Error (mm)2.63MGCNet
Facial Recognition and ModellingREALY (side-view)all1.787MGCNet
Face ReconstructionREALYall1.787MGCNet
Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.87MGCNet
Face ReconstructionNoW BenchmarkMedian Reconstruction Error1.31MGCNet
Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)2.63MGCNet
Face ReconstructionREALY (side-view)all1.787MGCNet
3DREALYall1.787MGCNet
3DNoW BenchmarkMean Reconstruction Error (mm)1.87MGCNet
3DNoW BenchmarkMedian Reconstruction Error1.31MGCNet
3DNoW BenchmarkStdev Reconstruction Error (mm)2.63MGCNet
3DREALY (side-view)all1.787MGCNet
3D Face ModellingREALYall1.787MGCNet
3D Face ModellingNoW BenchmarkMean Reconstruction Error (mm)1.87MGCNet
3D Face ModellingNoW BenchmarkMedian Reconstruction Error1.31MGCNet
3D Face ModellingNoW BenchmarkStdev Reconstruction Error (mm)2.63MGCNet
3D Face ModellingREALY (side-view)all1.787MGCNet
3D Face ReconstructionREALYall1.787MGCNet
3D Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.87MGCNet
3D Face ReconstructionNoW BenchmarkMedian Reconstruction Error1.31MGCNet
3D Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)2.63MGCNet
3D Face ReconstructionREALY (side-view)all1.787MGCNet

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14