TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DigiFace-1M: 1 Million Digital Face Images for Face Recogn...

DigiFace-1M: 1 Million Digital Face Images for Face Recognition

Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien Valentin, Roberto Cipolla, Jingjing Shen

2022-10-05Face RecognitionSynthetic Data GenerationAttributeSynthetic Face Recognition
PaperPDFCode(official)

Abstract

State-of-the-art face recognition models show impressive accuracy, achieving over 99.8% on Labeled Faces in the Wild (LFW) dataset. Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet. Web-crawled face images are severely biased (in terms of race, lighting, make-up, etc) and often contain label noise. More importantly, the face images are collected without explicit consent, raising ethical concerns. To avoid such problems, we introduce a large-scale synthetic dataset for face recognition, obtained by rendering digital faces using a computer graphics pipeline. We first demonstrate that aggressive data augmentation can significantly reduce the synthetic-to-real domain gap. Having full control over the rendering pipeline, we also study how each attribute (e.g., variation in facial pose, accessories and textures) affects the accuracy. Compared to SynFace, a recent method trained on GAN-generated synthetic faces, we reduce the error rate on LFW by 52.5% (accuracy from 91.93% to 96.17%). By fine-tuning the network on a smaller number of real face images that could reasonably be obtained with consent, we achieve accuracy that is comparable to the methods trained on millions of real face images.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingCPLFWAccuracy0.8223DigiFace-1M
Facial Recognition and ModellingLFWAccuracy0.9617DigiFace-1M
Facial Recognition and ModellingCALFWAccuracy0.8255DigiFace-1M
Facial Recognition and ModellingAgeDB-30Accuracy0.811DigiFace-1M
Facial Recognition and ModellingCFP-FPAccuracy0.8981DigiFace-1M
Face ReconstructionCPLFWAccuracy0.8223DigiFace-1M
Face ReconstructionLFWAccuracy0.9617DigiFace-1M
Face ReconstructionCALFWAccuracy0.8255DigiFace-1M
Face ReconstructionAgeDB-30Accuracy0.811DigiFace-1M
Face ReconstructionCFP-FPAccuracy0.8981DigiFace-1M
Face RecognitionCPLFWAccuracy0.8223DigiFace-1M
Face RecognitionLFWAccuracy0.9617DigiFace-1M
Face RecognitionCALFWAccuracy0.8255DigiFace-1M
Face RecognitionAgeDB-30Accuracy0.811DigiFace-1M
Face RecognitionCFP-FPAccuracy0.8981DigiFace-1M
3DCPLFWAccuracy0.8223DigiFace-1M
3DLFWAccuracy0.9617DigiFace-1M
3DCALFWAccuracy0.8255DigiFace-1M
3DAgeDB-30Accuracy0.811DigiFace-1M
3DCFP-FPAccuracy0.8981DigiFace-1M
3D Face ModellingCPLFWAccuracy0.8223DigiFace-1M
3D Face ModellingLFWAccuracy0.9617DigiFace-1M
3D Face ModellingCALFWAccuracy0.8255DigiFace-1M
3D Face ModellingAgeDB-30Accuracy0.811DigiFace-1M
3D Face ModellingCFP-FPAccuracy0.8981DigiFace-1M
3D Face ReconstructionCPLFWAccuracy0.8223DigiFace-1M
3D Face ReconstructionLFWAccuracy0.9617DigiFace-1M
3D Face ReconstructionCALFWAccuracy0.8255DigiFace-1M
3D Face ReconstructionAgeDB-30Accuracy0.811DigiFace-1M
3D Face ReconstructionCFP-FPAccuracy0.8981DigiFace-1M

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24Non-Adaptive Adversarial Face Generation2025-07-16MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation2025-07-15Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models2025-07-13Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training2025-07-11Model Parallelism With Subnetwork Data Parallelism2025-07-11