TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MRFP: Learning Generalizable Semantic Segmentation from Si...

MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation

Sumanth Udupa, Prajwal Gurunath, Aniruddh Sikdar, Suresh Sundaram

2023-11-30CVPR 2024 12D Semantic SegmentationAutonomous VehiclesDomain GeneralizationAutonomous DrivingSemantic Segmentation
PaperPDFCode(official)

Abstract

Deep neural networks have shown exemplary performance on semantic scene understanding tasks on source domains, but due to the absence of style diversity during training, enhancing performance on unseen target domains using only single source domain data remains a challenging task. Generation of simulated data is a feasible alternative to retrieving large style-diverse real-world datasets as it is a cumbersome and budget-intensive process. However, the large domain-specfic inconsistencies between simulated and real-world data pose a significant generalization challenge in semantic segmentation. In this work, to alleviate this problem, we propose a novel MultiResolution Feature Perturbation (MRFP) technique to randomize domain-specific fine-grained features and perturb style of coarse features. Our experimental results on various urban-scene segmentation datasets clearly indicate that, along with the perturbation of style-information, perturbation of fine-feature components is paramount to learn domain invariant robust feature maps for semantic segmentation models. MRFP is a simple and computationally efficient, transferable module with no additional learnable parameters or objective functions, that helps state-of-the-art deep neural networks to learn robust domain invariant features for simulation-to-real semantic segmentation.

Results

TaskDatasetMetricValueModel
Semantic SegmentationMapillary valmIoU44.93MRFP+(Ours) Resnet50
Semantic SegmentationMapillary valmIoU32.93Resnet50
Semantic SegmentationCityscapes valmIoU42.4MRFP+(Ours) Resnet50
Semantic SegmentationCityscapes valmIoU34.66Resnet50
Semantic SegmentationBDD100K valmIoU39.55MRFP+(Ours) Resnet50
Semantic SegmentationBDD100K valmIoU31.44Resnet50
Semantic SegmentationSYNTHIAmIoU30.22MRFP+(Ours) Resnet50
Semantic SegmentationSYNTHIAmIoU25.84Resnet50
10-shot image generationMapillary valmIoU44.93MRFP+(Ours) Resnet50
10-shot image generationMapillary valmIoU32.93Resnet50
10-shot image generationCityscapes valmIoU42.4MRFP+(Ours) Resnet50
10-shot image generationCityscapes valmIoU34.66Resnet50
10-shot image generationBDD100K valmIoU39.55MRFP+(Ours) Resnet50
10-shot image generationBDD100K valmIoU31.44Resnet50
10-shot image generationSYNTHIAmIoU30.22MRFP+(Ours) Resnet50
10-shot image generationSYNTHIAmIoU25.84Resnet50

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17