TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Holistic 3D Scene Understanding from a Single Image with I...

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

Cheng Zhang, Zhaopeng Cui, yinda zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu

2021-03-11CVPR 2021 13D Shape ReconstructionMonocular 3D Object DetectionRoom Layout EstimationScene Understandingobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.

Results

TaskDatasetMetricValueModel
Object DetectionSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
Object DetectionSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
3DSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
3DSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
3DPix3DCD0.0672IM3D
3D Object DetectionSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
3D Object DetectionSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
3D Shape ReconstructionPix3DCD0.0672IM3D
2D ClassificationSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
2D ClassificationSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
2D Object DetectionSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
2D Object DetectionSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
16kSUN RGB-DAP@0.15 (10 / NYU-37)45.21IM3D
16kSUN RGB-DAP@0.15 (NYU-37)24.1IM3D
Room Layout EstimationSUN RGB-DCamera Pitch2.98IM3D
Room Layout EstimationSUN RGB-DCamera Roll2.11IM3D
Room Layout EstimationSUN RGB-DIoU64.4IM3D

Related Papers

Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16