TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Two Stream 3D Semantic Scene Completion

Two Stream 3D Semantic Scene Completion

Martin Garbade, Yueh-Tung Chen, Johann Sawatzky, Juergen Gall

2018-04-103D geometryVocal Bursts Valence Prediction3D Semantic Scene Completion
PaperPDF

Abstract

Inferring the 3D geometry and the semantic meaning of surfaces, which are occluded, is a very challenging task. Recently, a first end-to-end learning approach has been proposed that completes a scene from a single depth image. The approach voxelizes the scene and predicts for each voxel if it is occupied and, if it is occupied, the semantic class label. In this work, we propose a two stream approach that leverages depth information and semantic information, which is inferred from the RGB image, for this task. The approach constructs an incomplete 3D semantic tensor, which uses a compact three-channel encoding for the inferred semantic information, and uses a 3D CNN to infer the complete 3D semantic tensor. In our experimental evaluation, we show that the proposed two stream approach substantially outperforms the state-of-the-art for semantic scene completion.

Results

TaskDatasetMetricValueModel
3D ReconstructionNYUv2mIoU34.1TS3D
3D ReconstructionSemanticKITTImIoU17.7TS3D+DNet+SATNet (Reported in SemanticKITTI dataset paper)
3D ReconstructionSemanticKITTImIoU10.2TS3D+DNet (Reported in SemanticKITTI dataset paper)
3D ReconstructionSemanticKITTImIoU9.5TS3D (Reported in SemanticKITTI dataset paper)
3DNYUv2mIoU34.1TS3D
3DSemanticKITTImIoU17.7TS3D+DNet+SATNet (Reported in SemanticKITTI dataset paper)
3DSemanticKITTImIoU10.2TS3D+DNet (Reported in SemanticKITTI dataset paper)
3DSemanticKITTImIoU9.5TS3D (Reported in SemanticKITTI dataset paper)
3D Semantic Scene CompletionNYUv2mIoU34.1TS3D
3D Semantic Scene CompletionSemanticKITTImIoU17.7TS3D+DNet+SATNet (Reported in SemanticKITTI dataset paper)
3D Semantic Scene CompletionSemanticKITTImIoU10.2TS3D+DNet (Reported in SemanticKITTI dataset paper)
3D Semantic Scene CompletionSemanticKITTImIoU9.5TS3D (Reported in SemanticKITTI dataset paper)

Related Papers

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling2025-07-15TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update2025-07-15Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion2025-07-11Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion2025-07-08DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation2025-07-08RoboScape: Physics-informed Embodied World Model2025-06-29DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting2025-06-26PanSt3R: Multi-view Consistent Panoptic Segmentation2025-06-26