Dissecting the impact of different loss functions with gradient surgery

Hong Xuan, Robert Pless

2022-01-27Metric Learning Retrieval Image Retrieval

Abstract

Pair-wise loss is an approach to metric learning that learns a semantic embedding by optimizing a loss function that encourages images from the same semantic class to be mapped closer than images from different classes. The literature reports a large and growing set of variations of the pair-wise loss strategies. Here we decompose the gradient of these loss functions into components that relate to how they push the relative feature positions of the anchor-positive and anchor-negative pairs. This decomposition allows the unification of a large collection of current pair-wise loss functions. Additionally, explicitly constructing pair-wise gradient updates to separate out these effects gives insights into which have the biggest impact, and leads to a simple algorithm that beats the state of the art for image retrieval on the CAR, CUB and Stanford Online products datasets.

Results

Task	Dataset	Metric	Value	Model
Metric Learning	CARS196	R@1	86.5	Gradient Surgery
Metric Learning	CUB-200-2011	R@1	63.8	Gradient Surgery
Metric Learning	In-Shop	R@1	92.21	Gradient Surgery
Metric Learning	Stanford Online Products	R@1	82.3	Gradient Surgery

Related Papers

Unsupervised Ground Metric Learning2025-07-17 From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17 HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17 A Survey of Context Engineering for Large Language Models2025-07-17 MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17 FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17 Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16 Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker2025-07-16