TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Rethinking Atrous Convolution for Semantic Image Segmentat...

Rethinking Atrous Convolution for Semantic Image Segmentation

Liang-Chieh Chen, George Papandreou, Florian Schroff, Hartwig Adam

2017-06-172D Semantic SegmentationThermal Image SegmentationDichotomous Image SegmentationSegmentationSemantic SegmentationImage Segmentation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in parallel to capture multi-scale context by adopting multiple atrous rates. Furthermore, we propose to augment our previously proposed Atrous Spatial Pyramid Pooling module, which probes convolutional features at multiple scales, with image-level features encoding global context and further boost performance. We also elaborate on implementation details and share our experience on training our system. The proposed `DeepLabv3' system significantly improves over our previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark.

Results

TaskDatasetMetricValueModel
Semantic SegmentationSELMAmIoU70.7DeepLabV3
Object DetectionDIS-TE4E-measure0.82DeeplabV3+
Object DetectionDIS-TE4HCE3709DeeplabV3+
Object DetectionDIS-TE4MAE0.111DeeplabV3+
Object DetectionDIS-TE4S-Measure0.744DeeplabV3+
Object DetectionDIS-TE4max F-Measure0.715DeeplabV3+
Object DetectionDIS-TE4weighted F-measure0.621DeeplabV3+
Object DetectionDIS-VDE-measure0.796DeeplabV3+
Object DetectionDIS-VDHCE1520DeeplabV3+
Object DetectionDIS-VDMAE0.114DeeplabV3+
Object DetectionDIS-VDS-Measure0.716DeeplabV3+
Object DetectionDIS-VDmax F-Measure0.66DeeplabV3+
Object DetectionDIS-VDweighted F-measure0.568DeeplabV3+
Object DetectionDIS-TE2E-measure0.813DeeplabV3+
Object DetectionDIS-TE2HCE516DeeplabV3+
Object DetectionDIS-TE2MAE0.105DeeplabV3+
Object DetectionDIS-TE2S-Measure0.729DeeplabV3+
Object DetectionDIS-TE2max F-Measure0.681DeeplabV3+
Object DetectionDIS-TE2weighted F-measure0.587DeeplabV3+
Object DetectionDIS-TE1E-measure0.772DeeplabV3+
Object DetectionDIS-TE1HCE234DeeplabV3+
Object DetectionDIS-TE1MAE0.102DeeplabV3+
Object DetectionDIS-TE1S-Measure0.694DeeplabV3+
Object DetectionDIS-TE1max F-Measure0.601DeeplabV3+
Object DetectionDIS-TE1weighted F-measure0.506DeeplabV3+
Object DetectionDIS-TE3E-measure0.833DeeplabV3+
Object DetectionDIS-TE3HCE999DeeplabV3+
Object DetectionDIS-TE3MAE0.102DeeplabV3+
Object DetectionDIS-TE3S-Measure0.749DeeplabV3+
Object DetectionDIS-TE3max F-Measure0.717DeeplabV3+
Object DetectionDIS-TE3weighted F-measure0.623DeeplabV3+
3DDIS-TE4E-measure0.82DeeplabV3+
3DDIS-TE4HCE3709DeeplabV3+
3DDIS-TE4MAE0.111DeeplabV3+
3DDIS-TE4S-Measure0.744DeeplabV3+
3DDIS-TE4max F-Measure0.715DeeplabV3+
3DDIS-TE4weighted F-measure0.621DeeplabV3+
3DDIS-VDE-measure0.796DeeplabV3+
3DDIS-VDHCE1520DeeplabV3+
3DDIS-VDMAE0.114DeeplabV3+
3DDIS-VDS-Measure0.716DeeplabV3+
3DDIS-VDmax F-Measure0.66DeeplabV3+
3DDIS-VDweighted F-measure0.568DeeplabV3+
3DDIS-TE2E-measure0.813DeeplabV3+
3DDIS-TE2HCE516DeeplabV3+
3DDIS-TE2MAE0.105DeeplabV3+
3DDIS-TE2S-Measure0.729DeeplabV3+
3DDIS-TE2max F-Measure0.681DeeplabV3+
3DDIS-TE2weighted F-measure0.587DeeplabV3+
3DDIS-TE1E-measure0.772DeeplabV3+
3DDIS-TE1HCE234DeeplabV3+
3DDIS-TE1MAE0.102DeeplabV3+
3DDIS-TE1S-Measure0.694DeeplabV3+
3DDIS-TE1max F-Measure0.601DeeplabV3+
3DDIS-TE1weighted F-measure0.506DeeplabV3+
3DDIS-TE3E-measure0.833DeeplabV3+
3DDIS-TE3HCE999DeeplabV3+
3DDIS-TE3MAE0.102DeeplabV3+
3DDIS-TE3S-Measure0.749DeeplabV3+
3DDIS-TE3max F-Measure0.717DeeplabV3+
3DDIS-TE3weighted F-measure0.623DeeplabV3+
RGB Salient Object DetectionDIS-TE4E-measure0.82DeeplabV3+
RGB Salient Object DetectionDIS-TE4HCE3709DeeplabV3+
RGB Salient Object DetectionDIS-TE4MAE0.111DeeplabV3+
RGB Salient Object DetectionDIS-TE4S-Measure0.744DeeplabV3+
RGB Salient Object DetectionDIS-TE4max F-Measure0.715DeeplabV3+
RGB Salient Object DetectionDIS-TE4weighted F-measure0.621DeeplabV3+
RGB Salient Object DetectionDIS-VDE-measure0.796DeeplabV3+
RGB Salient Object DetectionDIS-VDHCE1520DeeplabV3+
RGB Salient Object DetectionDIS-VDMAE0.114DeeplabV3+
RGB Salient Object DetectionDIS-VDS-Measure0.716DeeplabV3+
RGB Salient Object DetectionDIS-VDmax F-Measure0.66DeeplabV3+
RGB Salient Object DetectionDIS-VDweighted F-measure0.568DeeplabV3+
RGB Salient Object DetectionDIS-TE2E-measure0.813DeeplabV3+
RGB Salient Object DetectionDIS-TE2HCE516DeeplabV3+
RGB Salient Object DetectionDIS-TE2MAE0.105DeeplabV3+
RGB Salient Object DetectionDIS-TE2S-Measure0.729DeeplabV3+
RGB Salient Object DetectionDIS-TE2max F-Measure0.681DeeplabV3+
RGB Salient Object DetectionDIS-TE2weighted F-measure0.587DeeplabV3+
RGB Salient Object DetectionDIS-TE1E-measure0.772DeeplabV3+
RGB Salient Object DetectionDIS-TE1HCE234DeeplabV3+
RGB Salient Object DetectionDIS-TE1MAE0.102DeeplabV3+
RGB Salient Object DetectionDIS-TE1S-Measure0.694DeeplabV3+
RGB Salient Object DetectionDIS-TE1max F-Measure0.601DeeplabV3+
RGB Salient Object DetectionDIS-TE1weighted F-measure0.506DeeplabV3+
RGB Salient Object DetectionDIS-TE3E-measure0.833DeeplabV3+
RGB Salient Object DetectionDIS-TE3HCE999DeeplabV3+
RGB Salient Object DetectionDIS-TE3MAE0.102DeeplabV3+
RGB Salient Object DetectionDIS-TE3S-Measure0.749DeeplabV3+
RGB Salient Object DetectionDIS-TE3max F-Measure0.717DeeplabV3+
RGB Salient Object DetectionDIS-TE3weighted F-measure0.623DeeplabV3+
2D Semantic SegmentationWildScenesmIoU43.37DeepLabv3 (ResNet-50)
2D Semantic SegmentationWildScenesmIoU (Env DA)36.12DeepLabv3 (ResNet-50)
2D Semantic SegmentationWildScenesmIoU (Temporal DA) 43.95DeepLabv3 (ResNet-50)
2D ClassificationDIS-TE4E-measure0.82DeeplabV3+
2D ClassificationDIS-TE4HCE3709DeeplabV3+
2D ClassificationDIS-TE4MAE0.111DeeplabV3+
2D ClassificationDIS-TE4S-Measure0.744DeeplabV3+
2D ClassificationDIS-TE4max F-Measure0.715DeeplabV3+
2D ClassificationDIS-TE4weighted F-measure0.621DeeplabV3+
2D ClassificationDIS-VDE-measure0.796DeeplabV3+
2D ClassificationDIS-VDHCE1520DeeplabV3+
2D ClassificationDIS-VDMAE0.114DeeplabV3+
2D ClassificationDIS-VDS-Measure0.716DeeplabV3+
2D ClassificationDIS-VDmax F-Measure0.66DeeplabV3+
2D ClassificationDIS-VDweighted F-measure0.568DeeplabV3+
2D ClassificationDIS-TE2E-measure0.813DeeplabV3+
2D ClassificationDIS-TE2HCE516DeeplabV3+
2D ClassificationDIS-TE2MAE0.105DeeplabV3+
2D ClassificationDIS-TE2S-Measure0.729DeeplabV3+
2D ClassificationDIS-TE2max F-Measure0.681DeeplabV3+
2D ClassificationDIS-TE2weighted F-measure0.587DeeplabV3+
2D ClassificationDIS-TE1E-measure0.772DeeplabV3+
2D ClassificationDIS-TE1HCE234DeeplabV3+
2D ClassificationDIS-TE1MAE0.102DeeplabV3+
2D ClassificationDIS-TE1S-Measure0.694DeeplabV3+
2D ClassificationDIS-TE1max F-Measure0.601DeeplabV3+
2D ClassificationDIS-TE1weighted F-measure0.506DeeplabV3+
2D ClassificationDIS-TE3E-measure0.833DeeplabV3+
2D ClassificationDIS-TE3HCE999DeeplabV3+
2D ClassificationDIS-TE3MAE0.102DeeplabV3+
2D ClassificationDIS-TE3S-Measure0.749DeeplabV3+
2D ClassificationDIS-TE3max F-Measure0.717DeeplabV3+
2D ClassificationDIS-TE3weighted F-measure0.623DeeplabV3+
2D Object DetectionDIS-TE4E-measure0.82DeeplabV3+
2D Object DetectionDIS-TE4HCE3709DeeplabV3+
2D Object DetectionDIS-TE4MAE0.111DeeplabV3+
2D Object DetectionDIS-TE4S-Measure0.744DeeplabV3+
2D Object DetectionDIS-TE4max F-Measure0.715DeeplabV3+
2D Object DetectionDIS-TE4weighted F-measure0.621DeeplabV3+
2D Object DetectionDIS-VDE-measure0.796DeeplabV3+
2D Object DetectionDIS-VDHCE1520DeeplabV3+
2D Object DetectionDIS-VDMAE0.114DeeplabV3+
2D Object DetectionDIS-VDS-Measure0.716DeeplabV3+
2D Object DetectionDIS-VDmax F-Measure0.66DeeplabV3+
2D Object DetectionDIS-VDweighted F-measure0.568DeeplabV3+
2D Object DetectionDIS-TE2E-measure0.813DeeplabV3+
2D Object DetectionDIS-TE2HCE516DeeplabV3+
2D Object DetectionDIS-TE2MAE0.105DeeplabV3+
2D Object DetectionDIS-TE2S-Measure0.729DeeplabV3+
2D Object DetectionDIS-TE2max F-Measure0.681DeeplabV3+
2D Object DetectionDIS-TE2weighted F-measure0.587DeeplabV3+
2D Object DetectionDIS-TE1E-measure0.772DeeplabV3+
2D Object DetectionDIS-TE1HCE234DeeplabV3+
2D Object DetectionDIS-TE1MAE0.102DeeplabV3+
2D Object DetectionDIS-TE1S-Measure0.694DeeplabV3+
2D Object DetectionDIS-TE1max F-Measure0.601DeeplabV3+
2D Object DetectionDIS-TE1weighted F-measure0.506DeeplabV3+
2D Object DetectionDIS-TE3E-measure0.833DeeplabV3+
2D Object DetectionDIS-TE3HCE999DeeplabV3+
2D Object DetectionDIS-TE3MAE0.102DeeplabV3+
2D Object DetectionDIS-TE3S-Measure0.749DeeplabV3+
2D Object DetectionDIS-TE3max F-Measure0.717DeeplabV3+
2D Object DetectionDIS-TE3weighted F-measure0.623DeeplabV3+
10-shot image generationSELMAmIoU70.7DeepLabV3
16kDIS-TE4E-measure0.82DeeplabV3+
16kDIS-TE4HCE3709DeeplabV3+
16kDIS-TE4MAE0.111DeeplabV3+
16kDIS-TE4S-Measure0.744DeeplabV3+
16kDIS-TE4max F-Measure0.715DeeplabV3+
16kDIS-TE4weighted F-measure0.621DeeplabV3+
16kDIS-VDE-measure0.796DeeplabV3+
16kDIS-VDHCE1520DeeplabV3+
16kDIS-VDMAE0.114DeeplabV3+
16kDIS-VDS-Measure0.716DeeplabV3+
16kDIS-VDmax F-Measure0.66DeeplabV3+
16kDIS-VDweighted F-measure0.568DeeplabV3+
16kDIS-TE2E-measure0.813DeeplabV3+
16kDIS-TE2HCE516DeeplabV3+
16kDIS-TE2MAE0.105DeeplabV3+
16kDIS-TE2S-Measure0.729DeeplabV3+
16kDIS-TE2max F-Measure0.681DeeplabV3+
16kDIS-TE2weighted F-measure0.587DeeplabV3+
16kDIS-TE1E-measure0.772DeeplabV3+
16kDIS-TE1HCE234DeeplabV3+
16kDIS-TE1MAE0.102DeeplabV3+
16kDIS-TE1S-Measure0.694DeeplabV3+
16kDIS-TE1max F-Measure0.601DeeplabV3+
16kDIS-TE1weighted F-measure0.506DeeplabV3+
16kDIS-TE3E-measure0.833DeeplabV3+
16kDIS-TE3HCE999DeeplabV3+
16kDIS-TE3MAE0.102DeeplabV3+
16kDIS-TE3S-Measure0.749DeeplabV3+
16kDIS-TE3max F-Measure0.717DeeplabV3+
16kDIS-TE3weighted F-measure0.623DeeplabV3+

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17