TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyram...

S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation

Mohammed A. M. Elhassan, Chenhui Yang, Chenxi Huang, Tewodros Legesse Munea, Xin Hong, Abuzar B. M. Adam, Amina Benabid

2022-06-152D Semantic SegmentationReal-Time Semantic SegmentationSegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

Modern high-performance semantic segmentation methods employ a heavy backbone and dilated convolution to extract the relevant feature. Although extracting features with both contextual and semantic information is critical for the segmentation tasks, it brings a memory footprint and high computation cost for real-time applications. This paper presents a new model to achieve a trade-off between accuracy/speed for real-time road scene semantic segmentation. Specifically, we proposed a lightweight model named Scale-aware Strip Attention Guided Feature Pyramid Network (S$^2$-FPN). Our network consists of three main modules: Attention Pyramid Fusion (APF) module, Scale-aware Strip Attention Module (SSAM), and Global Feature Upsample (GFU) module. APF adopts an attention mechanisms to learn discriminative multi-scale features and help close the semantic gap between different levels. APF uses the scale-aware attention to encode global context with vertical stripping operation and models the long-range dependencies, which helps relate pixels with similar semantic label. In addition, APF employs channel-wise reweighting block (CRB) to emphasize the channel features. Finally, the decoder of S$^2$-FPN then adopts GFU, which is used to fuse features from APF and the encoder. Extensive experiments have been conducted on two challenging semantic segmentation benchmarks, which demonstrate that our approach achieves better accuracy/speed trade-off with different model settings. The proposed models have achieved a results of 76.2\%mIoU/87.3FPS, 77.4\%mIoU/67FPS, and 77.8\%mIoU/30.5FPS on Cityscapes dataset, and 69.6\%mIoU,71.0\% mIoU, and 74.2\% mIoU on Camvid dataset. The code for this work will be made available at \url{https://github.com/mohamedac29/S2-FPN

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapesmIoU77.8SPFNet34M
Semantic SegmentationCamVidmIoU74.2S^2-FPN34M
Semantic SegmentationCamVidFrame (fps)107.2S^2-FPN34
Semantic SegmentationCamVidmIoU71S^2-FPN34
Semantic SegmentationCamVidFrame (fps)124.2S^2-FPN18
Semantic SegmentationCamVidmIoU69.5S^2-FPN18
Semantic SegmentationCityscapesmIoU77.4S^2-FPN34
Semantic SegmentationCityscapesmIoU76.2S^2-FPN18
10-shot image generationCityscapesmIoU77.8SPFNet34M
10-shot image generationCamVidmIoU74.2S^2-FPN34M
10-shot image generationCamVidFrame (fps)107.2S^2-FPN34
10-shot image generationCamVidmIoU71S^2-FPN34
10-shot image generationCamVidFrame (fps)124.2S^2-FPN18
10-shot image generationCamVidmIoU69.5S^2-FPN18
10-shot image generationCityscapesmIoU77.4S^2-FPN34
10-shot image generationCityscapesmIoU76.2S^2-FPN18

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17