TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Symphonize 3D Semantic Scene Completion with Contextual In...

Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

Haoyi Jiang, Tianheng Cheng, Naiyu Gao, Haoyang Zhang, Tianwei Lin, Wenyu Liu, Xinggang Wang

2023-06-27CVPR 2024 1Autonomous Driving3D Reconstruction3D Semantic Scene Completion from a single RGB image3D Semantic Scene Completion
PaperPDFCode(official)

Abstract

`3D Semantic Scene Completion (SSC) has emerged as a nascent and pivotal undertaking in autonomous driving, aiming to predict voxel occupancy within volumetric scenes. However, prevailing methodologies primarily focus on voxel-wise feature aggregation, while neglecting instance semantics and scene context. In this paper, we present a novel paradigm termed Symphonies (Scene-from-Insts), that delves into the integration of instance queries to orchestrate 2D-to-3D reconstruction and 3D scene modeling. Leveraging our proposed Serial Instance-Propagated Attentions, Symphonies dynamically encodes instance-centric semantics, facilitating intricate interactions between image-based and volumetric domains. Simultaneously, Symphonies enables holistic scene comprehension by capturing context through the efficient fusion of instance queries, alleviating geometric ambiguity such as occlusion and perspective errors through contextual scene reasoning. Experimental results demonstrate that Symphonies achieves state-of-the-art performance on challenging benchmarks SemanticKITTI and SSCBench-KITTI-360, yielding remarkable mIoU scores of 15.04 and 18.58, respectively. These results showcase the paradigm's promising advancements. The code is available at https://github.com/hustvl/Symphonies.

Results

TaskDatasetMetricValueModel
ReconstructionKITTI-360mIoU18.58Symphonies
ReconstructionSemanticKITTImIoU15.04Symphonies
3D ReconstructionSemanticKITTImIoU15.04Symphonies (RGB input only)
3D ReconstructionKITTI-360mIoU18.58Symphonies
3D ReconstructionKITTI-360mIoU18.58Symphonies
3D ReconstructionSemanticKITTImIoU15.04Symphonies
3DSemanticKITTImIoU15.04Symphonies (RGB input only)
3DKITTI-360mIoU18.58Symphonies
3DKITTI-360mIoU18.58Symphonies
3DSemanticKITTImIoU15.04Symphonies
3D Semantic Scene CompletionSemanticKITTImIoU15.04Symphonies (RGB input only)
3D Semantic Scene CompletionKITTI-360mIoU18.58Symphonies
3D Semantic Scene CompletionKITTI-360mIoU18.58Symphonies
3D Semantic Scene CompletionSemanticKITTImIoU15.04Symphonies
3D Scene ReconstructionKITTI-360mIoU18.58Symphonies
3D Scene ReconstructionSemanticKITTImIoU15.04Symphonies
Single-View 3D ReconstructionKITTI-360mIoU18.58Symphonies
Single-View 3D ReconstructionSemanticKITTImIoU15.04Symphonies

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17Safeguarding Federated Learning-based Road Condition Classification2025-07-16