Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation

Junjie Li, Zilei Wang, Yuan Gao, Xiaoming Hu

2022-08-12Semantic Segmentation Synthetic-to-Real Translation Contrastive Learning Image-to-Image Translation Domain Adaptation

Paper PDF Code(official)

Abstract

In unsupervised domain adaptive (UDA) semantic segmentation, the distillation based methods are currently dominant in performance. However, the distillation technique requires complicate multi-stage process and many training tricks. In this paper, we propose a simple yet effective method that can achieve competitive performance to the advanced distillation methods. Our core idea is to fully explore the target-domain information from the views of boundaries and features. First, we propose a novel mix-up strategy to generate high-quality target-domain boundaries with ground-truth labels. Different from the source-domain boundaries in previous works, we select the high-confidence target-domain areas and then paste them to the source-domain images. Such a strategy can generate the object boundaries in target domain (edge of target-domain object areas) with the correct labels. Consequently, the boundary information of target domain can be effectively captured by learning on the mixed-up samples. Second, we design a multi-level contrastive loss to improve the representation of target-domain data, including pixel-level and prototype-level contrastive learning. By combining two proposed methods, more discriminative features can be extracted and hard object boundaries can be better addressed for the target domain. The experimental results on two commonly adopted benchmarks (\textit{i.e.}, GTA5 $\rightarrow$ Cityscapes and SYNTHIA $\rightarrow$ Cityscapes) show that our method achieves competitive performance to complicated distillation methods. Notably, for the SYNTHIA$\rightarrow$ Cityscapes scenario, our method achieves the state-of-the-art performance with $57.8\%$ mIoU and $64.6\%$ mIoU on 16 classes and 13 classes. Code is available at https://github.com/ljjcoder/EHTDI.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	58.8	EHTDI(ResNet-101)
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	69.2	EHTDI*
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	61.3	EHTDI*
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	64.6	EHTDI
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	57.8	EHTDI
Domain Adaptation	GTA5 to Cityscapes	mIoU	62	EHTDI*
Image Generation	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
Image Generation	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
Image Generation	GTAV-to-Cityscapes Labels	mIoU	58.8	EHTDI(ResNet-101)
Image Generation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	69.2	EHTDI*
Image Generation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	61.3	EHTDI*
Image Generation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	64.6	EHTDI
Image Generation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	57.8	EHTDI
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	62	EHTDI*
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	58.8	EHTDI(ResNet-101)
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (13 classes)	69.2	EHTDI*
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (16 classes)	61.3	EHTDI*
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (13 classes)	64.6	EHTDI
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (16 classes)	57.8	EHTDI

Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation

Abstract

Results

Related Papers

Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation

Abstract

Results

Related Papers