TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Each Part Matters: Local Patterns Facilitate Cross-view Ge...

Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization

Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, Yi Yang

2020-08-26Representation Learninggeo-localizationDrone navigationDrone-view target localizationImage-Based Localization
PaperPDFCode(official)

Abstract

Cross-view geo-localization is to spot images of the same geographic target from different platforms, e.g., drone-view cameras and satellites. It is challenging in the large visual appearance changes caused by extreme viewpoint variations. Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center, but underestimate the contextual information in neighbor areas. In this work, we argue that neighbor areas can be leveraged as auxiliary information, enriching discriminative clues for geolocalization. Specifically, we introduce a simple and effective deep neural network, called Local Pattern Network (LPN), to take advantage of contextual information in an end-to-end manner. Without using extra part estimators, LPN adopts a square-ring feature partition strategy, which provides the attention according to the distance to the image center. It eases the part matching and enables the part-wise representation learning. Owing to the square-ring partition design, the proposed LPN has good scalability to rotation variations and achieves competitive results on three prevailing benchmarks, i.e., University-1652, CVUSA and CVACT. Besides, we also show the proposed LPN can be easily embedded into other frameworks to further boost performance.

Results

TaskDatasetMetricValueModel
Object LocalizationcvusaRecall@185.79LPN
Object LocalizationcvusaRecall@1096.98LPN
Object LocalizationcvusaRecall@595.38LPN
Object LocalizationcvusaRecall@top1%99.41LPN
Object LocalizationcvactRecall@179.99LPN
Object LocalizationcvactRecall@1 (%)97.03LPN
Object LocalizationcvactRecall@1092.56LPN
Object LocalizationcvactRecall@590.63LPN
Image RetrievalUniversity-1652AP74.79LPN
Image RetrievalUniversity-1652Recall@186.45LPN
Image RetrievalUniversity-1652AP79.14LPN
Image RetrievalUniversity-1652Recall@175.93LPN
Content-Based Image RetrievalUniversity-1652AP74.79LPN
Content-Based Image RetrievalUniversity-1652Recall@186.45LPN
Content-Based Image RetrievalUniversity-1652AP79.14LPN
Content-Based Image RetrievalUniversity-1652Recall@175.93LPN

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15Dual Dimensions Geometric Representation Learning Based Document Dewarping2025-07-11