TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SpaGBOL: Spatial-Graph-Based Orientated Localisation

SpaGBOL: Spatial-Graph-Based Orientated Localisation

Tavis Shore, Oscar Mendez, Simon Hadfield

2024-09-23Visual Localizationgeo-localizationNavigateImage-Based LocalizationOutdoor LocalizationCamera LocalizationRetrievalCross-View Geo-LocalisationImage Retrieval
PaperPDFCode(official)

Abstract

Cross-View Geo-Localisation within urban regions is challenging in part due to the lack of geo-spatial structuring within current datasets and techniques. We propose utilising graph representations to model sequences of local observations and the connectivity of the target location. Modelling as a graph enables generating previously unseen sequences by sampling with new parameter configurations. To leverage this newly available information, we propose a GNN-based architecture, producing spatially strong embeddings and improving discriminability over isolated image embeddings. We outline SpaGBOL, introducing three novel contributions. 1) The first graph-structured dataset for Cross-View Geo-Localisation, containing multiple streetview images per node to improve generalisation. 2) Introducing GNNs to the problem, we develop the first system that exploits the correlation between node proximity and feature similarity. 3) Leveraging the unique properties of the graph representation - we demonstrate a novel retrieval filtering approach based on neighbourhood bearings. SpaGBOL achieves state-of-the-art accuracies on the unseen test graph - with relative Top-1 retrieval improvements on previous techniques of 11%, and 50% when filtering with Bearing Vector Matching on the SpaGBOL dataset.

Results

TaskDatasetMetricValueModel
Camera LocalizationSpaGBOLTop-156.48SpaGBOL
Camera LocalizationSpaGBOLTop-1%87.24SpaGBOL
Camera LocalizationSpaGBOLTop-1083.85SpaGBOL
Camera LocalizationSpaGBOLTop-577.47SpaGBOL
Camera LocalizationSpaGBOLTop-1%82.32Sample4Geo
Camera LocalizationSpaGBOLTop-1079.96Sample4Geo
Camera LocalizationSpaGBOLTop-574.22Sample4Geo
Camera LocalizationSpaGBOLTop-125.65SAIG-D
Camera LocalizationSpaGBOLTop-1%68.22SAIG-D
Camera LocalizationSpaGBOLTop-1062.29SAIG-D
Camera LocalizationSpaGBOLTop-551.44SAIG-D
Camera LocalizationSpaGBOLTop-117.49GeoDTR+
Camera LocalizationSpaGBOLTop-1%59.41GeoDTR+
Camera LocalizationSpaGBOLTop-1052.01GeoDTR+
Camera LocalizationSpaGBOLTop-540.27GeoDTR+
Camera LocalizationSpaGBOLTop-111.23L2LTR
Camera LocalizationSpaGBOLTop-1%49.52L2LTR
Camera LocalizationSpaGBOLTop-1042.5L2LTR
Camera LocalizationSpaGBOLTop-531.27L2LTR
Camera LocalizationSpaGBOLTop-15.82DSM
Camera LocalizationSpaGBOLTop-1%18.62DSM
Camera LocalizationSpaGBOLTop-1014.13DSM
Camera LocalizationSpaGBOLTop-510.21DSM
Camera LocalizationSpaGBOLTop-14.02CVFT
Camera LocalizationSpaGBOLTop-1%27.19CVFT
Camera LocalizationSpaGBOLTop-1020.29CVFT
Camera LocalizationSpaGBOLTop-12.87CVM-Net
Camera LocalizationSpaGBOLTop-1%28.33CVM-Net
Camera LocalizationSpaGBOLTop-1021.51CVM-Net
Camera LocalizationSpaGBOLTop-513.02CVM-Net
Camera LocalizationVIGOR-GraphAccuracy (Top-1)31.88SpaGBOL
Camera LocalizationSpaGBOL 180°Top-140.88SpaGBOL
Camera LocalizationSpaGBOL 180°Top-563.79SpaGBOL
Camera LocalizationSpaGBOL 90°Top-118.63SpaGBOL
Camera LocalizationSpaGBOL 90°Top-543.2SpaGBOL

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16