TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SPIN Road Mapper: Extracting Roads from Aerial Images via ...

SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving

Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

2021-09-16Road SegmentationAutonomous DrivingAutonomous Navigation
PaperPDFCode(official)

Abstract

Road extraction is an essential step in building autonomous navigation systems. Detecting road segments is challenging as they are of varying widths, bifurcated throughout the image, and are often occluded by terrain, cloud, or other weather conditions. Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between road segments in the image which is essential to extract road connectivity. To this end, we propose a Spatial and Interaction Space Graph Reasoning (SPIN) module which when plugged into a ConvNet performs reasoning over graphs constructed on spatial and interaction spaces projected from the feature maps. Reasoning over spatial space extracts dependencies between different spatial regions and other contextual information. Reasoning over a projected interaction space helps in appropriate delineation of roads from other topographies present in the image. Thus, SPIN extracts long-range dependencies between road segments and effectively delineates roads from other semantics. We also introduce a SPIN pyramid which performs SPIN graph reasoning across multiple scales to extract multi-scale features. We propose a network based on stacked hourglass modules and SPIN pyramid for road segmentation which achieves better performance compared to existing methods. Moreover, our method is computationally efficient and significantly boosts the convergence speed during training, making it feasible for applying on large-scale high-resolution aerial images. Code available at: https://github.com/wgcban/SPIN_RoadMapper.git.

Results

TaskDatasetMetricValueModel
Semantic SegmentationMassachusetts Roads DatasetAPLS72.49SPIN Road Mapper (ours)
Semantic SegmentationMassachusetts Roads DatasetIoU65.24SPIN Road Mapper (ours)
Semantic SegmentationDeepGlobeAPLS0.7414SPIN Road Mapper (ours)
Semantic SegmentationDeepGlobeIoU0.6702SPIN Road Mapper (ours)
10-shot image generationMassachusetts Roads DatasetAPLS72.49SPIN Road Mapper (ours)
10-shot image generationMassachusetts Roads DatasetIoU65.24SPIN Road Mapper (ours)
10-shot image generationDeepGlobeAPLS0.7414SPIN Road Mapper (ours)
10-shot image generationDeepGlobeIoU0.6702SPIN Road Mapper (ours)
Road SegmentationMassachusetts Roads DatasetAPLS72.49SPIN Road Mapper (ours)
Road SegmentationMassachusetts Roads DatasetIoU65.24SPIN Road Mapper (ours)
Road SegmentationDeepGlobeAPLS0.7414SPIN Road Mapper (ours)
Road SegmentationDeepGlobeIoU0.6702SPIN Road Mapper (ours)

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour2025-07-17Safeguarding Federated Learning-based Road Condition Classification2025-07-16