TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, Andrew Rabinovich

2020-03-23ECCV 2020 83D Scene ReconstructionSemantic Segmentation3D ReconstructionDepth Estimation3D Semantic Segmentation
PaperPDFCode

Abstract

We present an end-to-end 3D reconstruction method for a scene by directly regressing a truncated signed distance function (TSDF) from a set of posed RGB images. Traditional approaches to 3D reconstruction rely on an intermediate representation of depth maps prior to estimating a full 3D model of a scene. We hypothesize that a direct regression to 3D is more effective. A 2D CNN extracts features from each image independently which are then back-projected and accumulated into a voxel volume using the camera intrinsics and extrinsics. After accumulation, a 3D CNN refines the accumulated features and predicts the TSDF values. Additionally, semantic segmentation of the 3D model is obtained without significant computation. This approach is evaluated on the Scannet dataset where we significantly outperform state-of-the-art baselines (deep multiview stereo followed by traditional TSDF fusion) both quantitatively and qualitatively. We compare our 3D semantic segmentation to prior methods that use a depth sensor since no previous work attempts the problem with only RGB input.

Results

TaskDatasetMetricValueModel
Depth EstimationScanNetRMSE0.165Atlas (plain)
Depth EstimationScanNetRMSE0.174Atlas (finetuned)
Depth EstimationScanNetabsolute relative error0.089Atlas (finetuned)
3D ReconstructionScanNet3DIoU89.4Atlas (finetuned)
3D ReconstructionScanNetChamfer Distance37.2Atlas (finetuned)
3D ReconstructionScanNetL121.1Atlas (finetuned)
3DScanNetRMSE0.165Atlas (plain)
3DScanNetRMSE0.174Atlas (finetuned)
3DScanNetabsolute relative error0.089Atlas (finetuned)
3DScanNet3DIoU89.4Atlas (finetuned)
3DScanNetChamfer Distance37.2Atlas (finetuned)
3DScanNetL121.1Atlas (finetuned)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17