TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Global Proxy-based Hard Mining for Visual Place Recognition

Global Proxy-based Hard Mining for Visual Place Recognition

Amar Ali-bey, Brahim Chaib-Draa, Philippe Giguère

2023-02-28Image Similarity SearchMetric LearningVisual Place RecognitionImage Retrieval
PaperPDFCode(official)

Abstract

Learning deep representations for visual place recognition is commonly performed using pairwise or triple loss functions that highly depend on the hardness of the examples sampled at each training iteration. Existing techniques address this by using computationally and memory expensive offline hard mining, which consists of identifying, at each iteration, the hardest samples from the training set. In this paper we introduce a new technique that performs global hard mini-batch sampling based on proxies. To do so, we add a new end-to-end trainable branch to the network, which generates efficient place descriptors (one proxy for each place). These proxy representations are thus used to construct a global index that encompasses the similarities between all places in the dataset, allowing for highly informative mini-batch sampling at each training iteration. Our method can be used in combination with all existing pairwise and triplet loss functions with negligible additional memory and computation cost. We run extensive ablation studies and show that our technique brings new state-of-the-art performance on multiple large-scale benchmarks such as Pittsburgh, Mapillary-SLS and SPED. In particular, our method provides more than 100% relative improvement on the challenging Nordland dataset. Our code is available at https://github.com/amaralibey/GPM

Results

TaskDatasetMetricValueModel
Visual Place RecognitionNordlandRecall@144.9NetVLAD (with GPM)
Visual Place RecognitionNordlandRecall@550.2NetVLAD (with GPM)
Visual Place RecognitionPittsburgh-250k-testRecall@191.5NetVLAD (with GPM)
Visual Place RecognitionPittsburgh-250k-testRecall@1098.1NetVLAD (with GPM)
Visual Place RecognitionPittsburgh-250k-testRecall@597.2NetVLAD (with GPM)
Visual Place RecognitionMapillary valRecall@182NetVLAD (with GPM)
Visual Place RecognitionMapillary valRecall@1091.4NetVLAD (with GPM)
Visual Place RecognitionMapillary valRecall@590.4NetVLAD (with GPM)

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Unsupervised Ground Metric Learning2025-07-17FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection2025-07-11RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features2025-07-11MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval2025-07-09