ArcFace

Additive Angular Margin Loss

GeneralIntroduced 200094 papers

Description

ArcFace, or Additive Angular Margin Loss, is a loss function used in face recognition tasks. The softmax is traditionally used in these tasks. However, the softmax loss function does not explicitly optimise the feature embedding to enforce higher similarity for intraclass samples and diversity for inter-class samples, which results in a performance gap for deep face recognition under large intra-class appearance variations.

The ArcFace loss transforms the logits $W^{T}\_{j}x\_{i} = || W\_{j} || \text{ } || x\_{i} || \cos\theta\_{j}$ , where $\theta\_{j}$ is the angle between the weight $W\_{j}$ and the feature $x\_{i}$ . The individual weight $|| W\_{j} || = 1$ is fixed by $l\_{2}$ normalization. The embedding feature $||x\_{i} ||$ is fixed by $l\_{2}$ normalization and re-scaled to $s$ . The normalisation step on features and weights makes the predictions only depend on the angle between the feature and the weight. The learned embedding features are thus distributed on a hypersphere with a radius of $s$ . Finally, an additive angular margin penalty $m$ is added between $x\_{i}$ and $W\_{y\_{i}}$ to simultaneously enhance the intra-class compactness and inter-class discrepancy. Since the proposed additive angular margin penalty is equal to the geodesic distance margin penalty in the normalised hypersphere, the method is named ArcFace:

$L\_{3} = -\frac{1}{N}\sum^{N}\_{i=1}\log\frac{e^{s\left(\cos\left(\theta\_{y\_{i}} + m\right)\right)}}{e^{s\left(\cos\left(\theta\_{y\_{i}} + m\right)\right)} + \sum^{n}\_{j=1, j \neq y\_{i}}e^{s\cos\theta\_{j}}}$

The authors select face images from 8 different identities containing enough samples (around 1,500 images/class) to train 2-D feature embedding networks with the softmax and ArcFace loss, respectively. As the Figure shows, the softmax loss provides roughly separable feature embedding but produces noticeable ambiguity in decision boundaries, while the proposed ArcFace loss can obviously enforce a more evident gap between the nearest classes.

Other alternatives to enforce intra-class compactness and inter-class distance include Supervised Contrastive Learning.

Papers Using This Method

Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models2025-06-21 Towards Large-Scale Pose-Invariant Face Recognition Using Face Defrontalization2025-06-04 Accuracy and Fairness of Facial Recognition Technology in Low-Quality Police Images: An Experiment With Synthetic Faces2025-05-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images2025-03-20 Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success2025-02-18 Omni-ID: Holistic Identity Representation Designed for Generative Tasks2024-12-12 Multispecies Animal Re-ID Using a Large Community-Curated Dataset2024-12-07 Pairwise Discernment of AffectNet Expressions with ArcFace2024-12-01 Hypersphere Secure Sketch Revisited: Probabilistic Linear Regression Attack on IronMask in Multiple Usage2024-09-19 HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions2024-08-05 Analyzing the Feature Extractor Networks for Face Image Synthesis2024-06-04 Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition2024-04-03 Arc2Face: A Foundation Model for ID-Consistent Human Faces2024-03-18 VIGFace: Virtual Identity Generation for Privacy-Free Face Recognition2024-03-13 Mitigating the Impact of Attribute Editing on Face Recognition2024-03-12 X2-Softmax: Margin Adaptive Loss Function for Face Recognition2023-12-08 Improved Face Representation via Joint Label Classification and Supervised Contrastive Clustering2023-12-07 A Universal Anti-Spoofing Approach for Contactless Fingerprint Biometric Systems2023-10-23 An Empirical Study of Self-supervised Learning with Wasserstein Distance2023-10-16 Trading-off Mutual Information on Feature Aggregation for Face Recognition2023-09-22

Description

Other alternatives to enforce intra-class compactness and inter-class distance include Supervised Contrastive Learning.