LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks

Wei Lu, Si-Bao Chen, Chris H. Q. Ding, Jin Tang, Bin Luo

2025-01-17Scene Classification Image Classification Object Detection In Aerial Images Semantic Segmentation Oriented Object Detection Change Detection object-detection Object Detection

Paper PDF Code(official)

Abstract

Remote sensing (RS) visual tasks have gained significant academic and practical importance. However, they encounter numerous challenges that hinder effective feature extraction, including the detection and recognition of multiple objects exhibiting substantial variations in scale within a single image. While prior dual-branch or multi-branch architectural strategies have been effective in managing these object variances, they have concurrently resulted in considerable increases in computational demands and parameter counts. Consequently, these architectures are rendered less viable for deployment on resource-constrained devices. Contemporary lightweight backbone networks, designed primarily for natural images, frequently encounter difficulties in effectively extracting features from multi-scale objects, which compromises their efficacy in RS visual tasks. This article introduces LWGANet, a specialized lightweight backbone network tailored for RS visual tasks, incorporating a novel lightweight group attention (LWGA) module designed to address these specific challenges. LWGA module, tailored for RS imagery, adeptly harnesses redundant features to extract a wide range of spatial information, from local to global scales, without introducing additional complexity or computational overhead. This facilitates precise feature extraction across multiple scales within an efficient framework.LWGANet was rigorously evaluated across twelve datasets, which span four crucial RS visual tasks: scene classification, oriented object detection, semantic segmentation, and change detection. The results confirm LWGANet's widespread applicability and its ability to maintain an optimal balance between high performance and low complexity, achieving SOTA results across diverse datasets. LWGANet emerged as a novel solution for resource-limited scenarios requiring robust RS image processing capabilities.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	LoveDA	Category mIoU	53.6	LWGANet L2
Semantic Segmentation	UAVid	Mean IoU	69.1	LWGANet L2
Object Detection	DOTA	mAP	78.64	LWGANet L2
Object Detection	DIOR-R	mAP	68.53	LWGANet L2
Image Classification	RESISC45	Top 1 Accuracy	96.17	LWGANet L2
Image Classification	RESISC45	Top 1 Accuracy	95.7	LWGANet L1
Image Classification	RESISC45	Top 1 Accuracy	95.49	LWGANet L0
3D	DOTA	mAP	78.64	LWGANet L2
3D	DIOR-R	mAP	68.53	LWGANet L2
2D Classification	DOTA	mAP	78.64	LWGANet L2
2D Classification	DIOR-R	mAP	68.53	LWGANet L2
Change Detection	WHU-CD	F1	95.24	CLAFA-LWGANet L2
Change Detection	WHU-CD	IoU	90.92	CLAFA-LWGANet L2
Change Detection	WHU-CD	Precision	96.51	CLAFA-LWGANet L2
Change Detection	LEVIR-CD	F1	92.42	CLAFA-LWGANet L2
Change Detection	LEVIR-CD	F1-score	92.42	CLAFA-LWGANet L2
Change Detection	LEVIR-CD	IoU	85.9	CLAFA-LWGANet L2
Change Detection	LEVIR-CD	Precision	93.25	CLAFA-LWGANet L2
2D Object Detection	DOTA	mAP	78.64	LWGANet L2
2D Object Detection	DIOR-R	mAP	68.53	LWGANet L2
10-shot image generation	LoveDA	Category mIoU	53.6	LWGANet L2
10-shot image generation	UAVid	Mean IoU	69.1	LWGANet L2
16k	DOTA	mAP	78.64	LWGANet L2
16k	DIOR-R	mAP	68.53	LWGANet L2

LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks

Abstract

Results

Related Papers

LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks

Abstract

Results

Related Papers