TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sample4Geo: Hard Negative Sampling For Cross-View Geo-Loca...

Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation

Fabian Deuser, Konrad Habel, Norbert Oswald

2023-03-21ICCV 2023 1Visual Place RecognitionDrone-view target localizationImage-Based LocalizationContrastive LearningCross-View Geo-Localisation
PaperPDFCode(official)

Abstract

Cross-View Geo-Localisation is still a challenging task where additional modules, specific pre-processing or zooming strategies are necessary to determine accurate positions of images. Since different views have different geometries, pre-processing like polar transformation helps to merge them. However, this results in distorted images which then have to be rectified. Adding hard negatives to the training batch could improve the overall performance but with the default loss functions in geo-localisation it is difficult to include them. In this article, we present a simplified but effective architecture based on contrastive learning with symmetric InfoNCE loss that outperforms current state-of-the-art results. Our framework consists of a narrow training pipeline that eliminates the need of using aggregation modules, avoids further pre-processing steps and even increases the generalisation capability of the model to unknown regions. We introduce two types of sampling strategies for hard negatives. The first explicitly exploits geographically neighboring locations to provide a good starting point. The second leverages the visual similarity between the image embeddings in order to mine hard negative samples. Our work shows excellent performance on common cross-view datasets like CVUSA, CVACT, University-1652 and VIGOR. A comparison between cross-area and same-area settings demonstrate the good generalisation capability of our model.

Results

TaskDatasetMetricValueModel
Object LocalizationcvusaRecall@198.68Sample4Geo
Object LocalizationcvusaRecall@1099.78Sample4Geo
Object LocalizationcvusaRecall@599.68Sample4Geo
Object LocalizationcvusaRecall@top1%99.87Sample4Geo
Object LocalizationcvactRecall@190.81Sample4Geo
Object LocalizationcvactRecall@1 (%)98.77Sample4Geo
Object LocalizationcvactRecall@1097.48Sample4Geo
Object LocalizationcvactRecall@596.74Sample4Geo
Object LocalizationVIGOR Cross AreaHit Rate69.87Sample4Geo
Object LocalizationVIGOR Cross AreaRecall@161.7Sample4Geo
Object LocalizationVIGOR Cross AreaRecall@1%98.17Sample4Geo
Object LocalizationVIGOR Cross AreaRecall@1088Sample4Geo
Object LocalizationVIGOR Cross AreaRecall@583.5Sample4Geo
Object LocalizationVIGOR Same AreaHit Rate89.82Sample4Geo
Object LocalizationVIGOR Same AreaRecall@177.86Sample4Geo
Object LocalizationVIGOR Same AreaRecall@1%99.61Sample4Geo
Object LocalizationVIGOR Same AreaRecall@1097.21Sample4Geo
Object LocalizationVIGOR Same AreaRecall@595.66Sample4Geo
Camera LocalizationSpaGBOLTop-150.8Sample4Geo
Camera LocalizationSpaGBOLTop-150.8Sample4Geo
Image RetrievalUniversity-1652AP93.81Sample4Geo
Image RetrievalUniversity-1652Recall@192.65Sample4Geo
Visual Place RecognitionCV-CitiesRecall@174.49Sample4Geo
Visual Place RecognitionCV-CitiesRecall@584.07Sample4Geo
Content-Based Image RetrievalUniversity-1652AP93.81Sample4Geo
Content-Based Image RetrievalUniversity-1652Recall@192.65Sample4Geo

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15Latent Space Consistency for Sparse-View CT Reconstruction2025-07-15