TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CV-Cities: Advancing Cross-View Geo-Localization in Global...

CV-Cities: Advancing Cross-View Geo-Localization in Global Cities

Gaoshuang Huang, Yang Zhou, Luying Zhao, Wenjian Gan

2024-11-19geo-localizationVisual Place RecognitionDrone-view target localizationImage-Based LocalizationCross-View Geo-Localisation
PaperPDFCode(official)

Abstract

Cross-view geo-localization (CVGL), which involves matching and retrieving satellite images to determine the geographic location of a ground image, is crucial in GNSS-constrained scenarios. However, this task faces significant challenges due to substantial viewpoint discrepancies, the complexity of localization scenarios, and the need for global localization. To address these issues, we propose a novel CVGL framework that integrates the vision foundational model DINOv2 with an advanced feature mixer. Our framework introduces the symmetric InfoNCE loss and incorporates near-neighbor sampling and dynamic similarity sampling strategies, significantly enhancing localization accuracy. Experimental results show that our framework surpasses existing methods across multiple public and self-built datasets. To further improve globalscale performance, we have developed CV-Cities, a novel dataset for global CVGL. CV-Cities includes 223,736 ground-satellite image pairs with geolocation data, spanning sixteen cities across six continents and covering a wide range of complex scenarios, providing a challenging benchmark for CVGL. The framework trained with CV-Cities demonstrates high localization accuracy in various test cities, highlighting its strong globalization and generalization capabilities. Our datasets and codes are available at https://github.com/GaoShuang98/CVCities.

Results

TaskDatasetMetricValueModel
Object LocalizationcvusaRecall@199.19CV-Cities
Object LocalizationcvusaRecall@1099.85CV-Cities
Object LocalizationcvusaRecall@599.8CV-Cities
Object LocalizationcvusaRecall@top1%99.92CV-Cities
Object LocalizationcvactRecall@192.59CV-Cities
Object LocalizationcvactRecall@1 (%)98.72CV-Cities
Object LocalizationcvactRecall@1097.82CV-Cities
Object LocalizationcvactRecall@597.16CV-Cities
Object LocalizationVIGOR Cross AreaHit Rate75.97CV-Cities
Object LocalizationVIGOR Cross AreaRecall@164.61CV-Cities
Object LocalizationVIGOR Cross AreaRecall@1%98.63CV-Cities
Object LocalizationVIGOR Cross AreaRecall@1091.2CV-Cities
Object LocalizationVIGOR Cross AreaRecall@587.48CV-Cities
Object LocalizationVIGOR Same AreaHit Rate90.76CV-Cities
Object LocalizationVIGOR Same AreaRecall@178.27CV-Cities
Object LocalizationVIGOR Same AreaRecall@1%99.67CV-Cities
Object LocalizationVIGOR Same AreaRecall@1097.52CV-Cities
Object LocalizationVIGOR Same AreaRecall@596.1CV-Cities
Image RetrievalUniversity-1652AP95.01CV-Cities
Image RetrievalUniversity-1652Recall@197.43CV-Cities
Visual Place RecognitionCV-CitiesRecall@182.91CV-Cities
Visual Place RecognitionCV-CitiesRecall@590.14CV-Cities
Content-Based Image RetrievalUniversity-1652AP95.01CV-Cities
Content-Based Image RetrievalUniversity-1652Recall@197.43CV-Cities

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Grid-Reg: Grid-Based SAR and Optical Image Registration Across Platforms2025-07-06Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition2025-07-04Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation2025-06-19Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models2025-06-17A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing2025-06-16Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning2025-06-06HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition2025-06-05