TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DenseFusion: 6D Object Pose Estimation by Iterative Dense ...

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

Chen Wang, Danfei Xu, Yuke Zhu, Roberto Martín-Martín, Cewu Lu, Li Fei-Fei, Silvio Savarese

2019-01-15CVPR 2019 6Pose Estimation6D Pose Estimation using RGBD3D Object Detection6D Pose Estimation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCode

Abstract

A key technical challenge in performing 6D object pose estimation from RGB-D image is to fully leverage the two complementary data sources. Prior works either extract information from the RGB image and depth separately or use costly post-processing steps, limiting their performances in highly cluttered scenes and real-time applications. In this work, we present DenseFusion, a generic framework for estimating 6D pose of a set of known objects from RGB-D images. DenseFusion is a heterogeneous architecture that processes the two data sources individually and uses a novel dense fusion network to extract pixel-wise dense feature embedding, from which the pose is estimated. Furthermore, we integrate an end-to-end iterative pose refinement procedure that further improves the pose estimation while achieving near real-time inference. Our experiments show that our method outperforms state-of-the-art approaches in two datasets, YCB-Video and LineMOD. We also deploy our proposed method to a real robot to grasp and manipulate objects based on the estimated pose.

Results

TaskDatasetMetricValueModel
Pose EstimationYCB-VideoADDS AUC93.1DenseFusion
Pose EstimationLineMODAccuracy (ADD)94.3DenseFusion
Pose EstimationLineMODMean ADD94.3DeepFusion
Object DetectionDTTD-MobileADD AUC69.67DenseFusion
Object DetectionDTTD-MobileADD-S AUC85.88DenseFusion
3DDTTD-MobileADD AUC69.67DenseFusion
3DDTTD-MobileADD-S AUC85.88DenseFusion
3DYCB-VideoADDS AUC93.1DenseFusion
3DLineMODAccuracy (ADD)94.3DenseFusion
3DLineMODMean ADD94.3DeepFusion
3D Object DetectionDTTD-MobileADD AUC69.67DenseFusion
3D Object DetectionDTTD-MobileADD-S AUC85.88DenseFusion
6D Pose EstimationYCB-VideoADDS AUC93.1DenseFusion
6D Pose EstimationLineMODAccuracy (ADD)94.3DenseFusion
2D ClassificationDTTD-MobileADD AUC69.67DenseFusion
2D ClassificationDTTD-MobileADD-S AUC85.88DenseFusion
2D Object DetectionDTTD-MobileADD AUC69.67DenseFusion
2D Object DetectionDTTD-MobileADD-S AUC85.88DenseFusion
1 Image, 2*2 StitchiYCB-VideoADDS AUC93.1DenseFusion
1 Image, 2*2 StitchiLineMODAccuracy (ADD)94.3DenseFusion
1 Image, 2*2 StitchiLineMODMean ADD94.3DeepFusion
16kDTTD-MobileADD AUC69.67DenseFusion
16kDTTD-MobileADD-S AUC85.88DenseFusion

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16