PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation

Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua

2022-11-14Self-Supervised Learning Semantic Segmentation Synthetic-to-Real Translation Unsupervised Domain Adaptation Image-to-Image Translation Domain Adaptation

Paper PDF Code(official)

Abstract

Unsupervised Domain Adaptation (UDA) aims to enhance the generalization of the learned model to other domains. The domain-invariant knowledge is transferred from the model trained on labeled source domain, e.g., video game, to unlabeled target domains, e.g., real-world scenarios, saving annotation expenses. Existing UDA methods for semantic segmentation usually focus on minimizing the inter-domain discrepancy of various levels, e.g., pixels, features, and predictions, for extracting domain-invariant knowledge. However, the primary intra-domain knowledge, such as context correlation inside an image, remains underexplored. In an attempt to fill this gap, we propose a unified pixel- and patch-wise self-supervised learning framework, called PiPa, for domain adaptive semantic segmentation that facilitates intra-image pixel-wise correlations and patch-wise semantic consistency against different contexts. The proposed framework exploits the inherent structures of intra-domain images, which: (1) explicitly encourages learning the discriminative pixel-wise features with intra-class compactness and inter-class separability, and (2) motivates the robust feature learning of the identical patch against different contexts or fluctuations. Extensive experiments verify the effectiveness of the proposed method, which obtains competitive accuracy on the two widely-used UDA benchmarks, i.e., 75.6 mIoU on GTA to Cityscapes and 68.2 mIoU on Synthia to Cityscapes. Moreover, our method is compatible with other UDA approaches to further improve the performance without introducing extra parameters.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	74.8	HRDA + PiPa
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer + PiPa
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA+PiPa
Image-to-Image Translation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer+PiPa
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	74.8	HRDA+PiPa
Image-to-Image Translation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	68.2	HRDA+PiPa
Domain Adaptation	SYNTHIA-to-Cityscapes	mIoU	68.2	HRDA+PiPa
Domain Adaptation	GTA5 to Cityscapes	mIoU	75.6	HRDA+PiPa
Domain Adaptation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
Domain Adaptation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer + PiPa
Domain Adaptation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	74.8	HRDA + PiPa
Image Generation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	74.8	HRDA + PiPa
Image Generation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
Image Generation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer + PiPa
Image Generation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA+PiPa
Image Generation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer+PiPa
Image Generation	SYNTHIA-to-Cityscapes	MIoU (13 classes)	74.8	HRDA+PiPa
Image Generation	SYNTHIA-to-Cityscapes	MIoU (16 classes)	68.2	HRDA+PiPa
Semantic Segmentation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
Semantic Segmentation	SYNTHIA-to-Cityscapes	Mean IoU	68.2	HRDA + PiPa
Unsupervised Domain Adaptation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
Unsupervised Domain Adaptation	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer + PiPa
Unsupervised Domain Adaptation	SYNTHIA-to-Cityscapes	mIoU (13 classes)	74.8	HRDA + PiPa
10-shot image generation	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
10-shot image generation	SYNTHIA-to-Cityscapes	Mean IoU	68.2	HRDA + PiPa
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	mIoU (13 classes)	74.8	HRDA + PiPa
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA + PiPa
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer + PiPa
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	75.6	HRDA+PiPa
1 Image, 2*2 Stitching	GTAV-to-Cityscapes Labels	mIoU	71.7	DAFormer+PiPa
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (13 classes)	74.8	HRDA+PiPa
1 Image, 2*2 Stitching	SYNTHIA-to-Cityscapes	MIoU (16 classes)	68.2	HRDA+PiPa

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation

Abstract

Results

Related Papers

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation

Abstract

Results

Related Papers