TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Encoder-Decoder with Atrous Separable Convolution for Sema...

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, Hartwig Adam

2018-02-07ECCV 2018 9Image ClassificationLesion SegmentationSemantic SegmentationImage Segmentation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view, while the latter networks can capture sharper object boundaries by gradually recovering the spatial information. In this work, we propose to combine the advantages from both methods. Specifically, our proposed model, DeepLabv3+, extends DeepLabv3 by adding a simple yet effective decoder module to refine the segmentation results especially along object boundaries. We further explore the Xception model and apply the depthwise separable convolution to both Atrous Spatial Pyramid Pooling and decoder modules, resulting in a faster and stronger encoder-decoder network. We demonstrate the effectiveness of the proposed model on PASCAL VOC 2012 and Cityscapes datasets, achieving the test set performance of 89.0\% and 82.1\% without any post-processing. Our paper is accompanied with a publicly available reference implementation of the proposed models in Tensorflow at \url{https://github.com/tensorflow/models/tree/master/research/deeplab}.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationAnatomical Tracings of Lesions After Stroke (ATLAS)Dice0.4609DeepLab v3+
Medical Image SegmentationAnatomical Tracings of Lesions After Stroke (ATLAS)IoU0.3458DeepLab v3+
Medical Image SegmentationAnatomical Tracings of Lesions After Stroke (ATLAS)Precision0.5831DeepLab v3+
Semantic SegmentationUS3DmIoU74.42DeepLabV3+
Semantic SegmentationFine-Grained Grass Segmentation DatasetmIoU47.95DeepLabv3+
Semantic SegmentationPotsdammIoU83.67DeepLabV3+
Semantic SegmentationCityscapes valmIoU79.6DeepLabv3+ (Dilated-Xception-71)
Semantic SegmentationBDD100K valmIoU63.6Deeplabv3+
Semantic SegmentationUrbanLFmIoU (Real)76.27DeepLabV3+ (ResNet-101)
Semantic SegmentationSkyScapes-DenseMean IoU38.2DeepLabv3+
Semantic SegmentationAI-TODDice43.52DeepLabV3+(ResNet-50)
Semantic SegmentationPASCAL VOC 2012 valmIoU (Syn)75.39DeepLabV3+ (ResNet-101)
Semantic SegmentationEventScapemIoU53.65DeepLabV3+
Semantic SegmentationVaihingenmIoU72.9DeepLabV3+
Semantic SegmentationBJRoadIoU50.81DeepLabv3+
Semantic SegmentationTrans10KGFLOPs37.98DeepLabV3+
Semantic SegmentationDADA-segmIoU26.8DeepLabV3+ (ACDC)
10-shot image generationUS3DmIoU74.42DeepLabV3+
10-shot image generationFine-Grained Grass Segmentation DatasetmIoU47.95DeepLabv3+
10-shot image generationPotsdammIoU83.67DeepLabV3+
10-shot image generationCityscapes valmIoU79.6DeepLabv3+ (Dilated-Xception-71)
10-shot image generationBDD100K valmIoU63.6Deeplabv3+
10-shot image generationUrbanLFmIoU (Real)76.27DeepLabV3+ (ResNet-101)
10-shot image generationSkyScapes-DenseMean IoU38.2DeepLabv3+
10-shot image generationAI-TODDice43.52DeepLabV3+(ResNet-50)
10-shot image generationPASCAL VOC 2012 valmIoU (Syn)75.39DeepLabV3+ (ResNet-101)
10-shot image generationEventScapemIoU53.65DeepLabV3+
10-shot image generationVaihingenmIoU72.9DeepLabV3+
10-shot image generationBJRoadIoU50.81DeepLabv3+
10-shot image generationTrans10KGFLOPs37.98DeepLabV3+
10-shot image generationDADA-segmIoU26.8DeepLabV3+ (ACDC)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17