TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Query-Based Adaptive Aggregation for Multi-Dataset Joint T...

Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition

Jiuhong Xiao, Yang Zhou, Giuseppe Loianno

2025-07-04Visual Place Recognition
PaperPDF

Abstract

Deep learning methods for Visual Place Recognition (VPR) have advanced significantly, largely driven by large-scale datasets. However, most existing approaches are trained on a single dataset, which can introduce dataset-specific inductive biases and limit model generalization. While multi-dataset joint training offers a promising solution for developing universal VPR models, divergences among training datasets can saturate limited information capacity in feature aggregation layers, leading to suboptimal performance. To address these challenges, we propose Query-based Adaptive Aggregation (QAA), a novel feature aggregation technique that leverages learned queries as reference codebooks to effectively enhance information capacity without significant computational or parameter complexity. We show that computing the Cross-query Similarity (CS) between query-level image features and reference codebooks provides a simple yet effective way to generate robust descriptors. Our results demonstrate that QAA outperforms state-of-the-art models, achieving balanced generalization across diverse datasets while maintaining peak performance comparable to dataset-specific models. Ablation studies further explore QAA's mechanisms and scalability. Visualizations reveal that the learned queries exhibit diverse attention patterns across datasets. Code will be publicly released.

Results

TaskDatasetMetricValueModel
Visual Place RecognitionSVOX-SnowRecall@199.1QAA-DINOv2-B-8192
Visual Place RecognitionAmsterTimeRecall@163.7QAA-DINOv2-B-8192
Visual Place RecognitionNordlandRecall@196.7QAA-DINOv2-B-8192
Visual Place RecognitionSF-XL test v1Recall@194.4QAA-DINOv2-B-8192
Visual Place RecognitionSVOX-NightRecall@197.2QAA-DINOv2-B-8192
Visual Place RecognitionPittsburgh-250k-testRecall@196.6QAA-DINOv2-B-8192
Visual Place RecognitionSPEDRecall@191.8QAA-DINOv2-B-8192
Visual Place RecognitionPittsburgh-30k-testRecall@194.4QAA-DINOv2-B-8192
Visual Place RecognitionTokyo247Recall@198.4QAA-DINOv2-B-8192
Visual Place RecognitionSF-XL test v2Recall@194.6QAA-DINOv2-B-8192
Visual Place RecognitionMapillary valRecall@197.6QAA-DINOv2-B-8192
Visual Place RecognitionSVOX-RainRecall@198.4QAA-DINOv2-B-8192
Visual Place RecognitionMapillary testRecall@185.7QAA-DINOv2-B-8192
Visual Place RecognitionEynshamRecall@192.9QAA-DINOv2-B-8192
Visual Place RecognitionSVOX-OvercastRecall@198.4QAA-DINOv2-B-8192
Visual Place RecognitionSVOX-SunRecall@197.3QAA-DINOv2-B-8192
Visual Place RecognitionNordland* (2760 queries)Recall@191.8QAA-DINOv2-B-8192

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation2025-06-19Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning2025-06-06HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition2025-06-05TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition2025-05-22Place Recognition: A Comprehensive Review, Current Challenges and Future Directions2025-05-20MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark2025-05-18Geolocating Earth Imagery from ISS: Integrating Machine Learning with Astronaut Photography for Enhanced Geographic Mapping2025-04-29