TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruct...

Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images

Haozhe Xie, Hongxun Yao, Shengping Zhang, Shangchen Zhou, Wenxiu Sun

2020-06-22Object Reconstruction3D Object Reconstruction
PaperPDFCodeCode(official)Code

Abstract

Recovering the 3D shape of an object from single or multiple images with deep neural networks has been attracting increasing attention in the past few years. Mainstream works (e.g. 3D-R2N2) use recurrent neural networks (RNNs) to sequentially fuse feature maps of input images. However, RNN-based approaches are unable to produce consistent reconstruction results when given the same input images with different orders. Moreover, RNNs may forget important features from early input images due to long-term memory loss. To address these issues, we propose a novel framework for single-view and multi-view 3D object reconstruction, named Pix2Vox++. By using a well-designed encoder-decoder, it generates a coarse 3D volume from each input image. A multi-scale context-aware fusion module is then introduced to adaptively select high-quality reconstructions for different parts from all coarse 3D volumes to obtain a fused 3D volume. To further correct the wrongly recovered parts in the fused 3D volume, a refiner is adopted to generate the final output. Experimental results on the ShapeNet, Pix3D, and Things3D benchmarks show that Pix2Vox++ performs favorably against state-of-the-art methods in terms of both accuracy and efficiency.

Results

TaskDatasetMetricValueModel
Object ReconstructionData3D−R2N23DIoU0.67Pix2Vox++/A
Object ReconstructionData3D−R2N23DIoU0.645Pix2Vox++/F
3D Object ReconstructionData3D−R2N23DIoU0.67Pix2Vox++/A
3D Object ReconstructionData3D−R2N23DIoU0.645Pix2Vox++/F

Related Papers

DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation2025-07-08PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling2025-06-26Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos2025-06-10HuSc3D: Human Sculpture dataset for 3D object reconstruction2025-06-09Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations2025-06-05SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping2025-05-30Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention2025-05-23Low Resolution Next Best View for Robot Packing2025-05-07