TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Integrating Language Guidance into Vision-based Deep Metri...

Integrating Language Guidance into Vision-based Deep Metric Learning

Karsten Roth, Oriol Vinyals, Zeynep Akata

2022-03-16CVPR 2022 1Metric Learning
PaperPDFCode(official)

Abstract

Deep Metric Learning (DML) proposes to learn metric spaces which encode semantic similarities as embedding space distances. These spaces should be transferable to classes beyond those seen during training. Commonly, DML methods task networks to solve contrastive ranking tasks defined over binary class assignments. However, such approaches ignore higher-level semantic relations between the actual classes. This causes learned embedding spaces to encode incomplete semantic context and misrepresent the semantic relation between classes, impacting the generalizability of the learned metric space. To tackle this issue, we propose a language guidance objective for visual similarity learning. Leveraging language embeddings of expert- and pseudo-classnames, we contextualize and realign visual representation spaces corresponding to meaningful language semantics for better semantic consistency. Extensive experiments and ablations provide a strong motivation for our proposed approach and show language guidance offering significant, model-agnostic improvements for DML, achieving competitive and state-of-the-art results on all benchmarks. Code available at https://github.com/ExplainableML/LanguageGuidance_for_DML.

Results

TaskDatasetMetricValueModel
Metric LearningCARS196R@190.2ResNet50 + Language
Metric Learning CUB-200-2011R@171.4ResNet50 + Language
Metric LearningStanford Online ProductsR@181.3ResNet50 + Language

Related Papers

Unsupervised Ground Metric Learning2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection2025-07-11Grid-Reg: Grid-Based SAR and Optical Image Registration Across Platforms2025-07-06Dare to Plagiarize? Plagiarized Painting Recognition and Retrieval2025-06-29Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision2025-06-25Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning2025-06-23AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody-Antigen Affinity Ranking2025-06-21