TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SERNet-Former: Semantic Segmentation by Efficient Residual...

SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks

Serdar Erisen

2024-01-282D Semantic SegmentationSemantic Segmentation
PaperPDFCode(official)Code

Abstract

Improving the efficiency of state-of-the-art methods in semantic segmentation requires overcoming the increasing computational cost as well as issues such as fusing semantic information from global and local contexts. Based on the recent success and problems that convolutional neural networks (CNNs) encounter in semantic segmentation, this research proposes an encoder-decoder architecture with a unique efficient residual network, Efficient-ResNet. Attention-boosting gates (AbGs) and attention-boosting modules (AbMs) are deployed by aiming to fuse the equivariant and feature-based semantic information with the equivalent sizes of the output of global context of the efficient residual network in the encoder. Respectively, the decoder network is developed with the additional attention-fusion networks (AfNs) inspired by AbM. AfNs are designed to improve the efficiency in the one-to-one conversion of the semantic information by deploying additional convolution layers in the decoder part. Our network is tested on the challenging CamVid and Cityscapes datasets, and the proposed methods reveal significant improvements on the residual networks. To the best of our knowledge, the developed network, SERNet-Former, achieves state-of-the-art results (84.62 % mean IoU) on CamVid dataset and challenging results (87.35 % mean IoU) on Cityscapes validation dataset.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapes testMean IoU (class)84.83SERNet-Former
Semantic SegmentationCamVidMean IoU84.62SERNet-Former
Semantic SegmentationCityscapes valValidation mIoU87.35SERNet-Former
Semantic SegmentationCityscapes valmIoU87.35SERNet-Former
Semantic SegmentationBDD100K valmIoU67.42SERNet-Former_v2
Semantic SegmentationADE20K valmIoU59.35SERNet-Former_v2
Semantic SegmentationADE20KValidation mIoU59.35SERNet-Former
2D Semantic SegmentationCamVidmIoU84.62SERNet-Former
2D Semantic SegmentationCityscapes valmIoU87.35SERNet-Former
10-shot image generationCityscapes testMean IoU (class)84.83SERNet-Former
10-shot image generationCamVidMean IoU84.62SERNet-Former
10-shot image generationCityscapes valValidation mIoU87.35SERNet-Former
10-shot image generationCityscapes valmIoU87.35SERNet-Former
10-shot image generationBDD100K valmIoU67.42SERNet-Former_v2
10-shot image generationADE20K valmIoU59.35SERNet-Former_v2
10-shot image generationADE20KValidation mIoU59.35SERNet-Former

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15