TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FaceNet: A Unified Embedding for Face Recognition and Clus...

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, Dmitry Kalenichenko, James Philbin

2015-03-12CVPR 2015 6Face RecognitionFace VerificationClusteringFace IdentificationDisguised Face Verification
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Despite significant recent advances in the field of face recognition, implementing face verification and recognition efficiently at scale presents serious challenges to current approaches. In this paper we present a system, called FaceNet, that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure of face similarity. Once this space has been produced, tasks such as face recognition, verification and clustering can be easily implemented using standard techniques with FaceNet embeddings as feature vectors. Our method uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches. To train, we use triplets of roughly aligned matching / non-matching face patches generated using a novel online triplet mining method. The benefit of our approach is much greater representational efficiency: we achieve state-of-the-art face recognition performance using only 128-bytes per face. On the widely used Labeled Faces in the Wild (LFW) dataset, our system achieves a new record accuracy of 99.63%. On YouTube Faces DB it achieves 95.12%. Our system cuts the error rate in comparison to the best published result by 30% on both datasets. We also introduce the concept of harmonic embeddings, and a harmonic triplet loss, which describe different versions of face embeddings (produced by different networks) that are compatible to each other and allow for direct comparison between each other.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingCelebA+masksAccuracy90.96FaceNet
Facial Recognition and ModellingCASIA-WebFace+masksAccuracy84.21FaceNet
Facial Recognition and ModellingMegaFaceAccuracy86.47FaceNet
Face VerificationMegaFaceAccuracy86.47FaceNet
Face ReconstructionCelebA+masksAccuracy90.96FaceNet
Face ReconstructionCASIA-WebFace+masksAccuracy84.21FaceNet
Face ReconstructionMegaFaceAccuracy86.47FaceNet
Face RecognitionCelebA+masksAccuracy90.96FaceNet
Face RecognitionCASIA-WebFace+masksAccuracy84.21FaceNet
3DCelebA+masksAccuracy90.96FaceNet
3DCASIA-WebFace+masksAccuracy84.21FaceNet
3DMegaFaceAccuracy86.47FaceNet
3D Face ModellingCelebA+masksAccuracy90.96FaceNet
3D Face ModellingCASIA-WebFace+masksAccuracy84.21FaceNet
3D Face ModellingMegaFaceAccuracy86.47FaceNet
3D Face ReconstructionCelebA+masksAccuracy90.96FaceNet
3D Face ReconstructionCASIA-WebFace+masksAccuracy84.21FaceNet
3D Face ReconstructionMegaFaceAccuracy86.47FaceNet

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation2025-07-17Non-Adaptive Adversarial Face Generation2025-07-16Ranking Vectors Clustering: Theory and Applications2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09