TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Complementary Random Masking for RGB-Thermal Semantic Segm...

Complementary Random Masking for RGB-Thermal Semantic Segmentation

Ukcheol Shin, Kyunghyun Lee, In So Kweon, Jean Oh

2023-03-30Thermal Image SegmentationScene UnderstandingSemantic Segmentation
PaperPDFCode(official)

Abstract

RGB-thermal semantic segmentation is one potential solution to achieve reliable semantic scene understanding in adverse weather and lighting conditions. However, the previous studies mostly focus on designing a multi-modal fusion module without consideration of the nature of multi-modality inputs. Therefore, the networks easily become over-reliant on a single modality, making it difficult to learn complementary and meaningful representations for each modality. This paper proposes 1) a complementary random masking strategy of RGB-T images and 2) self-distillation loss between clean and masked input modalities. The proposed masking strategy prevents over-reliance on a single modality. It also improves the accuracy and robustness of the neural network by forcing the network to segment and classify objects even when one modality is partially available. Also, the proposed self-distillation loss encourages the network to extract complementary and meaningful representations from a single modality or complementary masked modalities. Based on the proposed method, we achieve state-of-the-art performance over three RGB-T semantic segmentation benchmarks. Our source code is available at https://github.com/UkcheolShin/CRM_RGBTSeg.

Results

TaskDatasetMetricValueModel
Semantic SegmentationKP day-nightmIoU55.2CRM_RGBTSeg
Semantic SegmentationPST900mIoU88CRM_RGBTSeg
Semantic SegmentationMFN DatasetmIOU61.4CRM_RGBT_Seg
Scene SegmentationKP day-nightmIoU55.2CRM_RGBTSeg
Scene SegmentationPST900mIoU88CRM_RGBTSeg
Scene SegmentationMFN DatasetmIOU61.4CRM_RGBT_Seg
2D Object DetectionKP day-nightmIoU55.2CRM_RGBTSeg
2D Object DetectionPST900mIoU88CRM_RGBTSeg
2D Object DetectionMFN DatasetmIOU61.4CRM_RGBT_Seg
10-shot image generationKP day-nightmIoU55.2CRM_RGBTSeg
10-shot image generationPST900mIoU88CRM_RGBTSeg
10-shot image generationMFN DatasetmIOU61.4CRM_RGBT_Seg

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17