TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LWGANet: A Lightweight Group Attention Backbone for Remote...

LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks

Wei Lu, Si-Bao Chen, Chris H. Q. Ding, Jin Tang, Bin Luo

2025-01-17Scene ClassificationImage ClassificationObject Detection In Aerial ImagesSemantic SegmentationOriented Object DetectionChange Detectionobject-detectionObject Detection
PaperPDFCode(official)

Abstract

Remote sensing (RS) visual tasks have gained significant academic and practical importance. However, they encounter numerous challenges that hinder effective feature extraction, including the detection and recognition of multiple objects exhibiting substantial variations in scale within a single image. While prior dual-branch or multi-branch architectural strategies have been effective in managing these object variances, they have concurrently resulted in considerable increases in computational demands and parameter counts. Consequently, these architectures are rendered less viable for deployment on resource-constrained devices. Contemporary lightweight backbone networks, designed primarily for natural images, frequently encounter difficulties in effectively extracting features from multi-scale objects, which compromises their efficacy in RS visual tasks. This article introduces LWGANet, a specialized lightweight backbone network tailored for RS visual tasks, incorporating a novel lightweight group attention (LWGA) module designed to address these specific challenges. LWGA module, tailored for RS imagery, adeptly harnesses redundant features to extract a wide range of spatial information, from local to global scales, without introducing additional complexity or computational overhead. This facilitates precise feature extraction across multiple scales within an efficient framework.LWGANet was rigorously evaluated across twelve datasets, which span four crucial RS visual tasks: scene classification, oriented object detection, semantic segmentation, and change detection. The results confirm LWGANet's widespread applicability and its ability to maintain an optimal balance between high performance and low complexity, achieving SOTA results across diverse datasets. LWGANet emerged as a novel solution for resource-limited scenarios requiring robust RS image processing capabilities.

Results

TaskDatasetMetricValueModel
Semantic SegmentationLoveDACategory mIoU53.6LWGANet L2
Semantic SegmentationUAVidMean IoU69.1LWGANet L2
Object DetectionDOTAmAP78.64LWGANet L2
Object DetectionDIOR-RmAP68.53LWGANet L2
Image ClassificationRESISC45Top 1 Accuracy96.17LWGANet L2
Image ClassificationRESISC45Top 1 Accuracy95.7LWGANet L1
Image ClassificationRESISC45Top 1 Accuracy95.49LWGANet L0
3DDOTAmAP78.64LWGANet L2
3DDIOR-RmAP68.53LWGANet L2
2D ClassificationDOTAmAP78.64LWGANet L2
2D ClassificationDIOR-RmAP68.53LWGANet L2
Change DetectionWHU-CDF195.24CLAFA-LWGANet L2
Change DetectionWHU-CDIoU90.92CLAFA-LWGANet L2
Change DetectionWHU-CDPrecision96.51CLAFA-LWGANet L2
Change DetectionLEVIR-CDF192.42CLAFA-LWGANet L2
Change DetectionLEVIR-CDF1-score92.42CLAFA-LWGANet L2
Change DetectionLEVIR-CDIoU85.9CLAFA-LWGANet L2
Change DetectionLEVIR-CDPrecision93.25CLAFA-LWGANet L2
2D Object DetectionDOTAmAP78.64LWGANet L2
2D Object DetectionDIOR-RmAP68.53LWGANet L2
10-shot image generationLoveDACategory mIoU53.6LWGANet L2
10-shot image generationUAVidMean IoU69.1LWGANet L2
16kDOTAmAP78.64LWGANet L2
16kDIOR-RmAP68.53LWGANet L2

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17