TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NDC-Scene: Boost Monocular 3D Semantic Scene Completion in...

NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space

Jiawei Yao, Chuming Li, Keqiang Sun, Yingjie Cai, Hao Li, Wanli Ouyang, Hongsheng Li

2023-09-26ICCV 2023 13D Semantic Scene Completion from a single RGB image3D Semantic Scene Completion
PaperPDFCode(official)

Abstract

Monocular 3D Semantic Scene Completion (SSC) has garnered significant attention in recent years due to its potential to predict complex semantics and geometry shapes from a single image, requiring no 3D inputs. In this paper, we identify several critical issues in current state-of-the-art methods, including the Feature Ambiguity of projected 2D features in the ray to the 3D space, the Pose Ambiguity of the 3D convolution, and the Computation Imbalance in the 3D convolution across different depth levels. To address these problems, we devise a novel Normalized Device Coordinates scene completion network (NDC-Scene) that directly extends the 2D feature map to a Normalized Device Coordinates (NDC) space, rather than to the world space directly, through progressive restoration of the dimension of depth with deconvolution operations. Experiment results demonstrate that transferring the majority of computation from the target 3D space to the proposed normalized device coordinates space benefits monocular SSC tasks. Additionally, we design a Depth-Adaptive Dual Decoder to simultaneously upsample and fuse the 2D and 3D feature maps, further improving overall performance. Our extensive experiments confirm that the proposed method consistently outperforms state-of-the-art methods on both outdoor SemanticKITTI and indoor NYUv2 datasets. Our code are available at https://github.com/Jiawei-Yao0812/NDCScene.

Results

TaskDatasetMetricValueModel
ReconstructionNYUv2mIoU29.03NDC-Scene
ReconstructionSemanticKITTImIoU12.58NDC-Scene
3D ReconstructionNYUv2mIoU29.03NDC-Scene
3D ReconstructionSemanticKITTImIoU12.58NDC-Scene
3DNYUv2mIoU29.03NDC-Scene
3DSemanticKITTImIoU12.58NDC-Scene
3D Semantic Scene CompletionNYUv2mIoU29.03NDC-Scene
3D Semantic Scene CompletionSemanticKITTImIoU12.58NDC-Scene
3D Scene ReconstructionNYUv2mIoU29.03NDC-Scene
3D Scene ReconstructionSemanticKITTImIoU12.58NDC-Scene
Single-View 3D ReconstructionNYUv2mIoU29.03NDC-Scene
Single-View 3D ReconstructionSemanticKITTImIoU12.58NDC-Scene

Related Papers

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion2025-07-11Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes2025-05-14SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion2025-03-21VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion2025-03-08Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations2025-03-08Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance2025-02-20Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion2025-01-13SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection2025-01-01