Hierarchical Average Precision Training for Pertinent Image Retrieval

Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot

2022-07-05Metric Learning Image Retrieval

Abstract

Image Retrieval is commonly evaluated with Average Precision (AP) or Recall@k. Yet, those metrics, are limited to binary labels and do not take into account errors' severity. This paper introduces a new hierarchical AP training method for pertinent image retrieval (HAP-PIER). HAPPIER is based on a new H-AP metric, which leverages a concept hierarchy to refine AP by integrating errors' importance and better evaluate rankings. To train deep models with H-AP, we carefully study the problem's structure and design a smooth lower bound surrogate combined with a clustering loss that ensures consistent ordering. Extensive experiments on 6 datasets show that HAPPIER significantly outperforms state-of-the-art methods for hierarchical retrieval, while being on par with the latest approaches when evaluating fine-grained ranking performances. Finally, we show that HAPPIER leads to better organization of the embedding space, and prevents most severe failure cases of non-hierarchical methods. Our code is publicly available at: https://github.com/elias-ramzi/HAPPIER.

Results

Task	Dataset	Metric	Value	Model
Image Retrieval	iNaturalist	R@1	71	HAPPIER_F (ResNet-50)
Image Retrieval	iNaturalist	R@1	70.7	HAPPIER (ResNet-50)
Metric Learning	DyML-Vehicle	Average-mAP	37	HAPPIER
Metric Learning	Stanford Online Products	R@1	81.8	HAPPIER_F
Metric Learning	Stanford Online Products	R@1	81	HAPPIER
Metric Learning	DyML-Animal	Average-mAP	43.8	HAPPIER
Metric Learning	DyML-Product	Average-mAP	38	HAPPIER

Related Papers

Unsupervised Ground Metric Learning2025-07-17 FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17 MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17 Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16 $\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection2025-07-11 RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features2025-07-11 MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval2025-07-09 Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning2025-07-09