TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Missing Modality Robustness in Semi-Supervised Multi-Modal...

Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Harsh Maheshwari, Yen-Cheng Liu, Zsolt Kira

2023-04-21Semi-Supervised Semantic SegmentationSemi-Supervised RGBD Semantic SegmentationSegmentationSemantic SegmentationRobust Semi-Supervised RGBD Semantic Segmentation
PaperPDFCode(official)

Abstract

Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing robustness in realistic scenarios where modalities are missing at the test time. To address these challenges, we first propose a simple yet efficient multi-modal fusion mechanism Linear Fusion, that performs better than the state-of-the-art multi-modal models even with limited supervision. Second, we propose M3L: Multi-modal Teacher for Masked Modality Learning, a semi-supervised framework that not only improves the multi-modal performance but also makes the model robust to the realistic missing modality scenario using unlabeled data. We create the first benchmark for semi-supervised multi-modal semantic segmentation and also report the robustness to missing modalities. Our proposal shows an absolute improvement of up to 10% on robust mIoU above the most competitive baselines. Our code is available at https://github.com/harshm121/M3L

Results

TaskDatasetMetricValueModel
Semantic SegmentationSUN-RGBDMean IoU (test)48.17DFormer-L
Semantic SegmentationStanford2D3D - RGBDmIoU57.16Linear Fusion (Segformer B2)
Semantic Segmentation2D-3D-SmIoU (0.1% labels)40.05M3L (Linear Fusion B2)
Semantic Segmentation2D-3D-SmIoU (0.2% labels)44.62M3L (Linear Fusion B2)
Semantic Segmentation2D-3D-SmIoU (1% labels)49.28M3L (Linear Fusion B2)
Semantic SegmentationStanford 2D-3DMM-Robust mIoU (0.1% labels)41.36M3L (Linear Fusion - Segformer B2)
Semantic SegmentationStanford 2D-3DmIoU (0.1% labels)44.1M3L (Linear Fusion - Segformer B2)
Semantic SegmentationStanford 2D-3DmIoU (0.1% labels)41.7Mean Teacher (Linear Fusion - Segformer B2)
10-shot image generationSUN-RGBDMean IoU (test)48.17DFormer-L
10-shot image generationStanford2D3D - RGBDmIoU57.16Linear Fusion (Segformer B2)
10-shot image generation2D-3D-SmIoU (0.1% labels)40.05M3L (Linear Fusion B2)
10-shot image generation2D-3D-SmIoU (0.2% labels)44.62M3L (Linear Fusion B2)
10-shot image generation2D-3D-SmIoU (1% labels)49.28M3L (Linear Fusion B2)
10-shot image generationStanford 2D-3DMM-Robust mIoU (0.1% labels)41.36M3L (Linear Fusion - Segformer B2)
10-shot image generationStanford 2D-3DmIoU (0.1% labels)44.1M3L (Linear Fusion - Segformer B2)
10-shot image generationStanford 2D-3DmIoU (0.1% labels)41.7Mean Teacher (Linear Fusion - Segformer B2)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17