TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hierarchical Temporal Context Learning for Camera-based Se...

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Bohan Li, Jiajun Deng, Wenyao Zhang, Zhujin Liang, Dalong Du, Xin Jin, Wenjun Zeng

2024-07-023D Semantic Scene Completion
PaperPDFCode(official)

Abstract

Camera-based 3D semantic scene completion (SSC) is pivotal for predicting complicated 3D layouts with limited 2D image observations. The existing mainstream solutions generally leverage temporal information by roughly stacking history frames to supplement the current frame, such straightforward temporal modeling inevitably diminishes valid clues and increases learning difficulty. To address this problem, we present HTCL, a novel Hierarchical Temporal Context Learning paradigm for improving camera-based semantic scene completion. The primary innovation of this work involves decomposing temporal context learning into two hierarchical steps: (a) cross-frame affinity measurement and (b) affinity-based dynamic refinement. Firstly, to separate critical relevant context from redundant information, we introduce the pattern affinity with scale-aware isolation and multiple independent learners for fine-grained contextual correspondence modeling. Subsequently, to dynamically compensate for incomplete observations, we adaptively refine the feature sampling locations based on initially identified locations with high affinity and their neighboring relevant regions. Our method ranks $1^{st}$ on the SemanticKITTI benchmark and even surpasses LiDAR-based methods in terms of mIoU on the OpenOccupancy benchmark. Our code is available on https://github.com/Arlo0o/HTCL.

Results

TaskDatasetMetricValueModel
3D ReconstructionSemanticKITTImIoU17.09HTCL-S
3DSemanticKITTImIoU17.09HTCL-S
3D Semantic Scene CompletionSemanticKITTImIoU17.09HTCL-S

Related Papers

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion2025-07-11Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes2025-05-14SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion2025-03-21VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion2025-03-08Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations2025-03-08Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance2025-02-20Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion2025-01-13SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection2025-01-01