Depth-aware CNN for RGB-D Segmentation

Weiyue Wang, Ulrich Neumann

2018-03-19ECCV 2018 9Thermal Image Segmentation Segmentation Semantic Segmentation

Abstract

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high computation and memory cost. To address these issues, we present Depth-aware CNN by introducing two intuitive, flexible and effective operations: depth-aware convolution and depth-aware average pooling. By leveraging depth similarity between pixels in the process of information propagation, geometry is seamlessly incorporated into CNN. Without introducing any additional parameters, both operators can be easily integrated into existing CNNs. Extensive experiments and ablation studies on challenging RGB-D semantic segmentation benchmarks validate the effectiveness and flexibility of our approach.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	Stanford2D3D - RGBD	Pixel Accuracy	65.4	Depth-aware CNN
Semantic Segmentation	Stanford2D3D - RGBD	mAcc	55.5	Depth-aware CNN
Semantic Segmentation	Stanford2D3D - RGBD	mIoU	39.5	Depth-aware CNN
Semantic Segmentation	MFN Dataset	mIOU	46.1	Depth-aware CNN
Scene Segmentation	MFN Dataset	mIOU	46.1	Depth-aware CNN
2D Object Detection	MFN Dataset	mIOU	46.1	Depth-aware CNN
10-shot image generation	Stanford2D3D - RGBD	Pixel Accuracy	65.4	Depth-aware CNN
10-shot image generation	Stanford2D3D - RGBD	mAcc	55.5	Depth-aware CNN
10-shot image generation	Stanford2D3D - RGBD	mIoU	39.5	Depth-aware CNN
10-shot image generation	MFN Dataset	mIOU	46.1	Depth-aware CNN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21 Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17 DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17 From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17 Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17 SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17 Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17 A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17