Learning to Adapt Structured Output Space for Semantic Segmentation

Yi-Hsuan Tsai, Wei-Chih Hung, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang, Manmohan Chandraker

2018-02-28CVPR 2018 6Segmentation Semantic Segmentation Synthetic-to-Real Translation Image-to-Image Translation Domain Adaptation

Paper PDF Code Code Code Code Code(official)Code Code Code Code Code Code Code

Abstract

Convolutional neural network-based approaches for semantic segmentation rely on supervision with pixel-level ground truth, but may not generalize well to unseen image domains. As the labeling process is tedious and labor intensive, developing algorithms that can adapt source ground truth labels to the target domain is of great interest. In this paper, we propose an adversarial learning method for domain adaptation in the context of semantic segmentation. Considering semantic segmentations as structured outputs that contain spatial similarities between the source and target domains, we adopt adversarial learning in the output space. To further enhance the adapted model, we construct a multi-level adversarial network to effectively perform output space domain adaptation at different feature levels. Extensive experiments and ablation study are conducted under various domain adaptation settings, including synthetic-to-real and cross-city scenarios. We show that the proposed method performs favorably against the state-of-the-art methods in terms of accuracy and visual quality.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	46.7	Multi-level Adaptation
Image-to-Image Translation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	45.9	Single-level Adaptation
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	42.4	AdaptSegNet(multi-level)
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	46.7	AdaptSegNet(Multi-level)
Domain Adaptation	Synscapes-to-Cityscapes	mIoU	52.7	AdaptSegNet
Image Generation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	46.7	Multi-level Adaptation
Image Generation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	45.9	Single-level Adaptation
Image Generation	GTAV-to-Cityscapes Labels	mIoU	42.4	AdaptSegNet(multi-level)
Image Generation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	46.7	AdaptSegNet(Multi-level)
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	mIoU (13 classes)	46.7	Multi-level Adaptation
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	mIoU (13 classes)	45.9	Single-level Adaptation
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	42.4	AdaptSegNet(multi-level)
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (13 classes)	46.7	AdaptSegNet(Multi-level)

Learning to Adapt Structured Output Space for Semantic Segmentation

Abstract

Results

Related Papers

Learning to Adapt Structured Output Space for Semantic Segmentation

Abstract

Results

Related Papers