TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards High Fidelity Monocular Face Reconstruction with R...

Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing

Abdallah Dib, Cedric Thebault, Junghyun Ahn, Philippe-Henri Gosselin, Christian Theobalt, Louis Chevallier

2021-03-29ICCV 2021 10Self-Supervised LearningFace Reconstruction3D Face Reconstruction
PaperPDF

Abstract

Robust face reconstruction from monocular image in general lighting conditions is challenging. Methods combining deep neural network encoders with differentiable rendering have opened up the path for very fast monocular reconstruction of geometry, lighting and reflectance. They can also be trained in self-supervised manner for increased robustness and better generalization. However, their differentiable rasterization based image formation models, as well as underlying scene parameterization, limit them to Lambertian face reflectance and to poor shape details. More recently, ray tracing was introduced for monocular face reconstruction within a classic optimization-based framework and enables state-of-the art results. However optimization-based approaches are inherently slow and lack robustness. In this paper, we build our work on the aforementioned approaches and propose a new method that greatly improves reconstruction quality and robustness in general scenes. We achieve this by combining a CNN encoder with a differentiable ray tracer, which enables us to base the reconstruction on much more advanced personalized diffuse and specular albedos, a more sophisticated illumination model and a plausible representation of self-shadows. This enables to take a big leap forward in reconstruction quality of shape, appearance and lighting even in scenes with difficult illumination. With consistent face attributes reconstruction, our method leads to practical applications such as relighting and self-shadows removal. Compared to state-of-the-art methods, our results show improved accuracy and validity of the approach.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingNoW BenchmarkMean Reconstruction Error (mm)1.57Dib et al. 2021
Facial Recognition and ModellingNoW BenchmarkMedian Reconstruction Error1.26Dib et al. 2021
Facial Recognition and ModellingNoW BenchmarkStdev Reconstruction Error (mm)1.31Dib et al. 2021
Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.57Dib et al. 2021
Face ReconstructionNoW BenchmarkMedian Reconstruction Error1.26Dib et al. 2021
Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)1.31Dib et al. 2021
3DNoW BenchmarkMean Reconstruction Error (mm)1.57Dib et al. 2021
3DNoW BenchmarkMedian Reconstruction Error1.26Dib et al. 2021
3DNoW BenchmarkStdev Reconstruction Error (mm)1.31Dib et al. 2021
3D Face ModellingNoW BenchmarkMean Reconstruction Error (mm)1.57Dib et al. 2021
3D Face ModellingNoW BenchmarkMedian Reconstruction Error1.26Dib et al. 2021
3D Face ModellingNoW BenchmarkStdev Reconstruction Error (mm)1.31Dib et al. 2021
3D Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.57Dib et al. 2021
3D Face ReconstructionNoW BenchmarkMedian Reconstruction Error1.26Dib et al. 2021
3D Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)1.31Dib et al. 2021

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model2025-07-01ShapeEmbed: a self-supervised learning framework for 2D contour quantification2025-07-01RetFiner: A Vision-Language Refinement Scheme for Retinal Foundation Models2025-06-27Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features2025-06-26Hybrid Deep Learning and Signal Processing for Arabic Dialect Recognition in Low-Resource Settings2025-06-26