TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Edge-aware Guidance Fusion Network for RGB Thermal Scene P...

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

WuJie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian

2021-12-09Scene ParsingThermal Image Segmentation
PaperPDFCode(official)

Abstract

RGB thermal scene parsing has recently attracted increasing research interest in the field of computer vision. However, most existing methods fail to perform good boundary extraction for prediction maps and cannot fully use high level features. In addition, these methods simply fuse the features from RGB and thermal modalities but are unable to obtain comprehensive fused features. To address these problems, we propose an edge-aware guidance fusion network (EGFNet) for RGB thermal scene parsing. First, we introduce a prior edge map generated using the RGB and thermal images to capture detailed information in the prediction map and then embed the prior edge information in the feature maps. To effectively fuse the RGB and thermal information, we propose a multimodal fusion module that guarantees adequate cross-modal fusion. Considering the importance of high level semantic information, we propose a global information module and a semantic information module to extract rich semantic information from the high-level features. For decoding, we use simple elementwise addition for cascaded feature fusion. Finally, to improve the parsing accuracy, we apply multitask deep supervision to the semantic and boundary maps. Extensive experiments were performed on benchmark datasets to demonstrate the effectiveness of the proposed EGFNet and its superior performance compared with state of the art methods. The code and results can be found at https://github.com/ShaohuaDong2021/EGFNet.

Results

TaskDatasetMetricValueModel
Semantic SegmentationPST900mIoU78.51EGFNet
Semantic SegmentationMFN DatasetmIOU54.8EGFNet
Scene SegmentationPST900mIoU78.51EGFNet
Scene SegmentationMFN DatasetmIOU54.8EGFNet
2D Object DetectionPST900mIoU78.51EGFNet
2D Object DetectionMFN DatasetmIOU54.8EGFNet
10-shot image generationPST900mIoU78.51EGFNet
10-shot image generationMFN DatasetmIOU54.8EGFNet

Related Papers

A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects2025-06-16DepthMatch: Semi-Supervised RGB-D Scene Parsing through Depth-Guided Regularization2025-05-26Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation2025-05-11MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation2025-05-05Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance2025-03-04Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing2025-02-10Hardware implementation of timely reliable Bayesian decision-making using memristors2024-12-07OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing2024-11-05