TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sampling Matters in Deep Embedding Learning

Sampling Matters in Deep Embedding Learning

Chao-yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl

2017-06-23ICCV 2017 10Face VerificationMetric LearningClusteringRetrievalZero-Shot LearningImage Retrieval
PaperPDFCodeCodeCodeCodeCodeCode

Abstract

Deep embeddings answer one simple question: How similar are two images? Learning these embeddings is the bedrock of verification, zero-shot learning, and visual search. The most prominent approaches optimize a deep convolutional network with a suitable loss function, such as contrastive loss or triplet loss. While a rich line of work focuses solely on the loss functions, we show in this paper that selecting training examples plays an equally important role. We propose distance weighted sampling, which selects more informative and stable examples than traditional approaches. In addition, we show that a simple margin based loss is sufficient to outperform all other loss functions. We evaluate our approach on the Stanford Online Products, CAR196, and the CUB200-2011 datasets for image retrieval and clustering, and on the LFW dataset for face verification. Our method achieves state-of-the-art performance on all of them.

Results

TaskDatasetMetricValueModel
Image RetrievalCARS196R@186.9Margin
Metric LearningCARS196R@179.6ResNet-50 + Margin
Metric Learning CUB-200-2011R@163.6ResNet-50 + Margin

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation2025-07-17Unsupervised Ground Metric Learning2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17