TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-...

ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li

2021-08-24ICCV 2021 10Thermal Image SegmentationSegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as well as the base (whereabout) of it in a larger context. Compared with the base, the shape probably is more inherent and has a stronger connection to the semantics, and thus is more critical for segmentation accuracy. Inspired by this observation, we introduce a Shape-aware Convolutional layer (ShapeConv) for processing the depth feature, where the depth feature is firstly decomposed into a shape-component and a base-component, next two learnable weights are introduced to cooperate with them independently, and finally a convolution is applied on the re-weighted combination of these two components. ShapeConv is model-agnostic and can be easily integrated into most CNNs to replace vanilla convolutional layers for semantic segmentation. Extensive experiments on three challenging indoor RGB-D semantic segmentation benchmarks, i.e., NYU-Dv2(-13,-40), SUN RGB-D, and SID, demonstrate the effectiveness of our ShapeConv when employing it over five popular architectures. Moreover, the performance of CNNs with ShapeConv is boosted without introducing any computation and memory increase in the inference phase. The reason is that the learnt weights for balancing the importance between the shape and base components in ShapeConv become constants in the inference phase, and thus can be fused into the following convolution, resulting in a network that is identical to one with vanilla convolutional layers.

Results

TaskDatasetMetricValueModel
Semantic SegmentationLLRGBD-syntheticmIoU63.26ShapeConv (ResNeXt-101)
Semantic SegmentationGAMUSmIoU55.86ShapeConv
Semantic SegmentationStanford2D3D - RGBDPixel Accuracy82.7ShapeConv-101
Semantic SegmentationStanford2D3D - RGBDmAcc70ShapeConv-101
Semantic SegmentationStanford2D3D - RGBDmIoU60.6ShapeConv-101
Semantic SegmentationRGB-T-Glass-SegmentationMAE0.054ShapeConv
Scene SegmentationRGB-T-Glass-SegmentationMAE0.054ShapeConv
2D Object DetectionRGB-T-Glass-SegmentationMAE0.054ShapeConv
10-shot image generationLLRGBD-syntheticmIoU63.26ShapeConv (ResNeXt-101)
10-shot image generationGAMUSmIoU55.86ShapeConv
10-shot image generationStanford2D3D - RGBDPixel Accuracy82.7ShapeConv-101
10-shot image generationStanford2D3D - RGBDmAcc70ShapeConv-101
10-shot image generationStanford2D3D - RGBDmIoU60.6ShapeConv-101
10-shot image generationRGB-T-Glass-SegmentationMAE0.054ShapeConv

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17