TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning to Adapt Structured Output Space for Semantic Seg...

Learning to Adapt Structured Output Space for Semantic Segmentation

Yi-Hsuan Tsai, Wei-Chih Hung, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang, Manmohan Chandraker

2018-02-28CVPR 2018 6SegmentationSemantic SegmentationSynthetic-to-Real TranslationImage-to-Image TranslationDomain Adaptation
PaperPDFCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCode

Abstract

Convolutional neural network-based approaches for semantic segmentation rely on supervision with pixel-level ground truth, but may not generalize well to unseen image domains. As the labeling process is tedious and labor intensive, developing algorithms that can adapt source ground truth labels to the target domain is of great interest. In this paper, we propose an adversarial learning method for domain adaptation in the context of semantic segmentation. Considering semantic segmentations as structured outputs that contain spatial similarities between the source and target domains, we adopt adversarial learning in the output space. To further enhance the adapted model, we construct a multi-level adversarial network to effectively perform output space domain adaptation at different feature levels. Extensive experiments and ablation study are conducted under various domain adaptation settings, including synthetic-to-real and cross-city scenarios. We show that the proposed method performs favorably against the state-of-the-art methods in terms of accuracy and visual quality.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationSYNTHIA-to-CityscapesmIoU (13 classes)46.7Multi-level Adaptation
Image-to-Image TranslationSYNTHIA-to-CityscapesmIoU (13 classes)45.9Single-level Adaptation
Image-to-Image TranslationGTAV-to-Cityscapes LabelsmIoU42.4AdaptSegNet(multi-level)
Image-to-Image TranslationSYNTHIA-to-CityscapesMIoU (13 classes)46.7AdaptSegNet(Multi-level)
Domain AdaptationSynscapes-to-CityscapesmIoU52.7AdaptSegNet
Image GenerationSYNTHIA-to-CityscapesmIoU (13 classes)46.7Multi-level Adaptation
Image GenerationSYNTHIA-to-CityscapesmIoU (13 classes)45.9Single-level Adaptation
Image GenerationGTAV-to-Cityscapes LabelsmIoU42.4AdaptSegNet(multi-level)
Image GenerationSYNTHIA-to-CityscapesMIoU (13 classes)46.7AdaptSegNet(Multi-level)
1 Image, 2*2 StitchingSYNTHIA-to-CityscapesmIoU (13 classes)46.7Multi-level Adaptation
1 Image, 2*2 StitchingSYNTHIA-to-CityscapesmIoU (13 classes)45.9Single-level Adaptation
1 Image, 2*2 StitchingGTAV-to-Cityscapes LabelsmIoU42.4AdaptSegNet(multi-level)
1 Image, 2*2 StitchingSYNTHIA-to-CityscapesMIoU (13 classes)46.7AdaptSegNet(Multi-level)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17