TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Metrical Reconstruction of Human Faces

Towards Metrical Reconstruction of Human Faces

Wojciech Zielonka, Timo Bolkart, Justus Thies

2022-04-13Face RecognitionFace Reconstruction3D Face Reconstruction2k
PaperPDFCode(official)

Abstract

Face reconstruction and tracking is a building block of numerous applications in AR/VR, human-machine interaction, as well as medical applications. Most of these applications rely on a metrically correct prediction of the shape, especially, when the reconstructed subject is put into a metrical context (i.e., when there is a reference object of known size). A metrical reconstruction is also needed for any application that measures distances and dimensions of the subject (e.g., to virtually fit a glasses frame). State-of-the-art methods for face reconstruction from a single image are trained on large 2D image datasets in a self-supervised fashion. However, due to the nature of a perspective projection they are not able to reconstruct the actual face dimensions, and even predicting the average human face outperforms some of these methods in a metrical sense. To learn the actual shape of a face, we argue for a supervised training scheme. Since there exists no large-scale 3D dataset for this task, we annotated and unified small- and medium-scale databases. The resulting unified dataset is still a medium-scale dataset with more than 2k identities and training purely on it would lead to overfitting. To this end, we take advantage of a face recognition network pretrained on a large-scale 2D image dataset, which provides distinct features for different faces and is robust to expression, illumination, and camera changes. Using these features, we train our face shape estimator in a supervised fashion, inheriting the robustness and generalization of the face recognition network. Our method, which we call MICA (MetrIC fAce), outperforms the state-of-the-art reconstruction methods by a large margin, both on current non-metric benchmarks as well as on our metric benchmarks (15% and 24% lower average error on NoW, respectively).

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingREALYall2.134MICA
Facial Recognition and ModellingNoW BenchmarkMean Reconstruction Error (mm)1.11MICA
Facial Recognition and ModellingNoW BenchmarkMedian Reconstruction Error0.9MICA
Facial Recognition and ModellingNoW BenchmarkStdev Reconstruction Error (mm)0.92MICA
Facial Recognition and ModellingREALY (side-view)all2.145MICA
Face ReconstructionREALYall2.134MICA
Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.11MICA
Face ReconstructionNoW BenchmarkMedian Reconstruction Error0.9MICA
Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)0.92MICA
Face ReconstructionREALY (side-view)all2.145MICA
3DREALYall2.134MICA
3DNoW BenchmarkMean Reconstruction Error (mm)1.11MICA
3DNoW BenchmarkMedian Reconstruction Error0.9MICA
3DNoW BenchmarkStdev Reconstruction Error (mm)0.92MICA
3DREALY (side-view)all2.145MICA
3D Face ModellingREALYall2.134MICA
3D Face ModellingNoW BenchmarkMean Reconstruction Error (mm)1.11MICA
3D Face ModellingNoW BenchmarkMedian Reconstruction Error0.9MICA
3D Face ModellingNoW BenchmarkStdev Reconstruction Error (mm)0.92MICA
3D Face ModellingREALY (side-view)all2.145MICA
3D Face ReconstructionREALYall2.134MICA
3D Face ReconstructionNoW BenchmarkMean Reconstruction Error (mm)1.11MICA
3D Face ReconstructionNoW BenchmarkMedian Reconstruction Error0.9MICA
3D Face ReconstructionNoW BenchmarkStdev Reconstruction Error (mm)0.92MICA
3D Face ReconstructionREALY (side-view)all2.145MICA

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24Non-Adaptive Adversarial Face Generation2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization2025-07-14MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization2025-07-10Understanding and Improving Length Generalization in Recurrent Models2025-07-03Face mask detection project report.2025-07-02On the Burstiness of Faces in Set2025-06-25