TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Focus on Local: Finding Reliable Discriminative Regions fo...

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Changwei Wang, Shunpeng Chen, Yukun Song, Rongtao Xu, Zherui Zhang, Jiguang Zhang, Haoran Yang, Yu Zhang, Kexue Fu, Shide Du, Zhiwei Xu, Longxiang Gao, Li Guo, Shibiao Xu

2025-04-14Visual Place RecognitionRe-RankingRetrievalImage Retrieval
PaperPDFCode(official)

Abstract

Visual Place Recognition (VPR) is aimed at predicting the location of a query image by referencing a database of geotagged images. For VPR task, often fewer discriminative local regions in an image produce important effects while mundane background regions do not contribute or even cause perceptual aliasing because of easy overlap. However, existing methods lack precisely modeling and full exploitation of these discriminative regions. In this paper, we propose the Focus on Local (FoL) approach to stimulate the performance of image retrieval and re-ranking in VPR simultaneously by mining and exploiting reliable discriminative local regions in images and introducing pseudo-correlation supervision. First, we design two losses, Extraction-Aggregation Spatial Alignment Loss (SAL) and Foreground-Background Contrast Enhancement Loss (CEL), to explicitly model reliable discriminative local regions and use them to guide the generation of global representations and efficient re-ranking. Second, we introduce a weakly-supervised local feature training strategy based on pseudo-correspondences obtained from aggregating global features to alleviate the lack of local correspondences ground truth for the VPR task. Third, we suggest an efficient re-ranking pipeline that is efficiently and precisely based on discriminative region guidance. Finally, experimental results show that our FoL achieves the state-of-the-art on multiple VPR benchmarks in both image retrieval and re-ranking stages and also significantly outperforms existing two-stage VPR methods in terms of computational efficiency. Code and models are available at https://github.com/chenshunpeng/FoL

Results

TaskDatasetMetricValueModel
Visual Place RecognitionSVOX-SnowRecall@199.3FoL
Visual Place RecognitionSVOX-SnowRecall@1099.9FoL
Visual Place RecognitionSVOX-SnowRecall@599.8FoL
Visual Place RecognitionSVOX-SnowRecall@199.1FoL-global
Visual Place RecognitionSVOX-SnowRecall@1099.8FoL-global
Visual Place RecognitionSVOX-SnowRecall@599.7FoL-global
Visual Place RecognitionAmsterTimeRecall@170.1FoL
Visual Place RecognitionAmsterTimeRecall@1090FoL
Visual Place RecognitionAmsterTimeRecall@591.8FoL
Visual Place RecognitionAmsterTimeRecall@164.6FoL-global
Visual Place RecognitionAmsterTimeRecall@1084.3FoL-global
Visual Place RecognitionAmsterTimeRecall@588.2FoL-global
Visual Place RecognitionNordlandRecall@192.6FoL
Visual Place RecognitionNordlandRecall@1098FoL
Visual Place RecognitionNordlandRecall@596.9FoL
Visual Place RecognitionNordlandRecall@187.8FoL-global
Visual Place RecognitionNordlandRecall@1096.4FoL-global
Visual Place RecognitionNordlandRecall@594.5FoL-global
Visual Place RecognitionSVOX-NightRecall@198.8FoL
Visual Place RecognitionSVOX-NightRecall@1099.9FoL
Visual Place RecognitionSVOX-NightRecall@599.8FoL
Visual Place RecognitionSVOX-NightRecall@198.3FoL-global
Visual Place RecognitionSVOX-NightRecall@1099.6FoL-global
Visual Place RecognitionSVOX-NightRecall@599.6FoL-global
Visual Place RecognitionSF-XL OcclusionRecall@161.8FoL
Visual Place RecognitionSF-XL OcclusionRecall@1077.6FoL
Visual Place RecognitionSF-XL OcclusionRecall@577.6FoL
Visual Place RecognitionSF-XL OcclusionRecall@151.3FoL-global
Visual Place RecognitionSF-XL OcclusionRecall@10737FoL-global
Visual Place RecognitionSF-XL OcclusionRecall@565.8FoL-global
Visual Place RecognitionSF-XL NightRecall@160.5FoL
Visual Place RecognitionSF-XL NightRecall@1075.8FoL
Visual Place RecognitionSF-XL NightRecall@572.8FoL
Visual Place RecognitionSF-XL NightRecall@153.4FoL-global
Visual Place RecognitionSF-XL NightRecall@1071.7FoL-global
Visual Place RecognitionSF-XL NightRecall@565.9FoL-global
Visual Place RecognitionSt LuciaRecall@199.9FoL-global
Visual Place RecognitionSt LuciaRecall@10100FoL-global
Visual Place RecognitionSt LuciaRecall@5100FoL-global
Visual Place RecognitionSt LuciaRecall@199.9FoL
Visual Place RecognitionSt LuciaRecall@10100FoL
Visual Place RecognitionSt LuciaRecall@5100FoL
Visual Place RecognitionPittsburgh-250k-testRecall@197FoL
Visual Place RecognitionPittsburgh-250k-testRecall@1099.2FoL
Visual Place RecognitionPittsburgh-250k-testRecall@599.5FoL
Visual Place RecognitionPittsburgh-250k-testRecall@196.5FoL-global
Visual Place RecognitionPittsburgh-250k-testRecall@1099.1FoL-global
Visual Place RecognitionPittsburgh-250k-testRecall@599.5FoL-global
Visual Place RecognitionSVOXRecall@198.9FoL
Visual Place RecognitionSVOXRecall@1099.7FoL
Visual Place RecognitionSVOXRecall@599.6FoL
Visual Place RecognitionSVOXRecall@198.4FoL-global
Visual Place RecognitionSVOXRecall@1099.6FoL-global
Visual Place RecognitionSVOXRecall@599.4FoL-global
Visual Place RecognitionSPEDRecall@192.1FoL-global
Visual Place RecognitionSPEDRecall@1098FoL-global
Visual Place RecognitionSPEDRecall@596.5FoL-global
Visual Place RecognitionSPEDRecall@191.8FoL
Visual Place RecognitionSPEDRecall@1097.4FoL
Visual Place RecognitionSPEDRecall@596.5FoL
Visual Place RecognitionPittsburgh-30k-testRecall@194.5FoL
Visual Place RecognitionPittsburgh-30k-testRecall@1098.2FoL
Visual Place RecognitionPittsburgh-30k-testRecall@597.4FoL
Visual Place RecognitionPittsburgh-30k-testRecall@193.9FoL-global
Visual Place RecognitionPittsburgh-30k-testRecall@1098.1FoL-global
Visual Place RecognitionPittsburgh-30k-testRecall@597.2FoL-global
Visual Place RecognitionTokyo247Recall@198.4FoL
Visual Place RecognitionTokyo247Recall@1099.4FoL
Visual Place RecognitionTokyo247Recall@599.1FoL
Visual Place RecognitionTokyo247Recall@196.2FoL-global
Visual Place RecognitionTokyo247Recall@1098.7FoL-global
Visual Place RecognitionTokyo247Recall@598.7FoL-global
Visual Place RecognitionMapillary valRecall@193.5FoL
Visual Place RecognitionMapillary valRecall@1097.6FoL
Visual Place RecognitionMapillary valRecall@596.9FoL
Visual Place RecognitionMapillary valRecall@193.1FoL-global
Visual Place RecognitionMapillary valRecall@1097.4FoL-global
Visual Place RecognitionMapillary valRecall@596.9FoL-global
Visual Place RecognitionSVOX-RainRecall@198.2FoL
Visual Place RecognitionSVOX-RainRecall@196.5FoL-global
Visual Place RecognitionMapillary testRecall@180FoL
Visual Place RecognitionMapillary testRecall@1093FoL
Visual Place RecognitionMapillary testRecall@590.9FoL
Visual Place RecognitionMapillary testRecall@178.7FoL-global
Visual Place RecognitionMapillary testRecall@1093FoL-global
Visual Place RecognitionMapillary testRecall@590.8FoL-global
Visual Place RecognitionEynshamRecall@192.4FoL
Visual Place RecognitionEynshamRecall@1095.8FoL
Visual Place RecognitionEynshamRecall@596.6FoL
Visual Place RecognitionEynshamRecall@191.7FoL-global
Visual Place RecognitionEynshamRecall@1095.3FoL-global
Visual Place RecognitionEynshamRecall@596.2FoL-global
Visual Place RecognitionSVOX-OvercastRecall@198.2FoL
Visual Place RecognitionSVOX-OvercastRecall@1099.7FoL
Visual Place RecognitionSVOX-OvercastRecall@599.3FoL
Visual Place RecognitionSVOX-OvercastRecall@197.9FoL-global
Visual Place RecognitionSVOX-OvercastRecall@1099.3FoL-global
Visual Place RecognitionSVOX-OvercastRecall@599.2FoL-global
Visual Place RecognitionSVOX-SunRecall@198.8FoL
Visual Place RecognitionSVOX-SunRecall@1099.9FoL
Visual Place RecognitionSVOX-SunRecall@599.8FoL
Visual Place RecognitionSVOX-SunRecall@198.1FoL- global
Visual Place RecognitionSVOX-SunRecall@1099.5FoL- global
Visual Place RecognitionSVOX-SunRecall@599.4FoL- global
Visual Place RecognitionNordland* (2760 queries)Recall@185.5FoL
Visual Place RecognitionNordland* (2760 queries)Recall@1096.5FoL
Visual Place RecognitionNordland* (2760 queries)Recall@594.6FoL
Visual Place RecognitionNordland* (2760 queries)Recall@178.3FoL-global
Visual Place RecognitionNordland* (2760 queries)Recall@1094FoL-global
Visual Place RecognitionNordland* (2760 queries)Recall@590.8FoL-global

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker2025-07-16