MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation

Sumanth Udupa, Prajwal Gurunath, Aniruddh Sikdar, Suresh Sundaram

2023-11-30CVPR 2024 12D Semantic Segmentation Autonomous Vehicles Domain Generalization Autonomous Driving Semantic Segmentation

Paper PDF Code(official)

Abstract

Deep neural networks have shown exemplary performance on semantic scene understanding tasks on source domains, but due to the absence of style diversity during training, enhancing performance on unseen target domains using only single source domain data remains a challenging task. Generation of simulated data is a feasible alternative to retrieving large style-diverse real-world datasets as it is a cumbersome and budget-intensive process. However, the large domain-specfic inconsistencies between simulated and real-world data pose a significant generalization challenge in semantic segmentation. In this work, to alleviate this problem, we propose a novel MultiResolution Feature Perturbation (MRFP) technique to randomize domain-specific fine-grained features and perturb style of coarse features. Our experimental results on various urban-scene segmentation datasets clearly indicate that, along with the perturbation of style-information, perturbation of fine-feature components is paramount to learn domain invariant robust feature maps for semantic segmentation models. MRFP is a simple and computationally efficient, transferable module with no additional learnable parameters or objective functions, that helps state-of-the-art deep neural networks to learn robust domain invariant features for simulation-to-real semantic segmentation.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	Mapillary val	mIoU	44.93	MRFP+(Ours) Resnet50
Semantic Segmentation	Mapillary val	mIoU	32.93	Resnet50
Semantic Segmentation	Cityscapes val	mIoU	42.4	MRFP+(Ours) Resnet50
Semantic Segmentation	Cityscapes val	mIoU	34.66	Resnet50
Semantic Segmentation	BDD100K val	mIoU	39.55	MRFP+(Ours) Resnet50
Semantic Segmentation	BDD100K val	mIoU	31.44	Resnet50
Semantic Segmentation	SYNTHIA	mIoU	30.22	MRFP+(Ours) Resnet50
Semantic Segmentation	SYNTHIA	mIoU	25.84	Resnet50
10-shot image generation	Mapillary val	mIoU	44.93	MRFP+(Ours) Resnet50
10-shot image generation	Mapillary val	mIoU	32.93	Resnet50
10-shot image generation	Cityscapes val	mIoU	42.4	MRFP+(Ours) Resnet50
10-shot image generation	Cityscapes val	mIoU	34.66	Resnet50
10-shot image generation	BDD100K val	mIoU	39.55	MRFP+(Ours) Resnet50
10-shot image generation	BDD100K val	mIoU	31.44	Resnet50
10-shot image generation	SYNTHIA	mIoU	30.22	MRFP+(Ours) Resnet50
10-shot image generation	SYNTHIA	mIoU	25.84	Resnet50

MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation

Abstract

Results

Related Papers

MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation

Abstract

Results

Related Papers