TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Robust 6DoF Pose Estimation Against Depth Noise and a Comp...

Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Zixun Huang, Keling Yao, Seth Z. Zhao, Chuanyu Pan, Chenfeng Xu, Kathy Zhuang, Tianjian Xu, Weiyu Feng, Allen Y. Yang

2023-09-243D Object TrackingPose EstimationObject Tracking6D Pose Estimation using RGBD3D Object Detection6D Pose Estimation
PaperPDFCode(official)Code(official)Code

Abstract

Robust 6DoF pose estimation with mobile devices is the foundation for applications in robotics, augmented reality, and digital twin localization. In this paper, we extensively investigate the robustness of existing RGBD-based 6DoF pose estimation methods against varying levels of depth sensor noise. We highlight that existing 6DoF pose estimation methods suffer significant performance discrepancies due to depth measurement inaccuracies. In response to the robustness issue, we present a simple and effective transformer-based 6DoF pose estimation approach called DTTDNet, featuring a novel geometric feature filtering module and a Chamfer distance loss for training. Moreover, we advance the field of robust 6DoF pose estimation and introduce a new dataset -- Digital Twin Tracking Dataset Mobile (DTTD-Mobile), tailored for digital twin object tracking with noisy depth data from the mobile RGBD sensor suite of the Apple iPhone 14 Pro. Extensive experiments demonstrate that DTTDNet significantly outperforms state-of-the-art methods at least 4.32, up to 60.74 points in ADD metrics on the DTTD-Mobile. More importantly, our approach exhibits superior robustness to varying levels of measurement noise, setting a new benchmark for the robustness to noise measurements. Code and dataset are made publicly available at: https://github.com/augcog/DTTD2

Results

TaskDatasetMetricValueModel
Pose EstimationYCB-VideoADDS AUC94.19DTTD-Net w/o refiner
Pose EstimationDTTD-MobileADD AUC73.99DTTDNet
Pose EstimationDTTD-MobileADD-S AUC88.1DTTDNet
Pose EstimationYCB-VideoADD-S (2cm)96.14DTTDNet
Pose EstimationYCB-VideoADD-S AUC94.19DTTDNet
Object DetectionDTTD-MobileADD AUC73.99DTTDNet
Object DetectionDTTD-MobileADD-S AUC88.1DTTDNet
3DDTTD-MobileADD AUC73.99DTTDNet
3DDTTD-MobileADD-S AUC88.1DTTDNet
3DYCB-VideoADDS AUC94.19DTTD-Net w/o refiner
3DDTTD-MobileADD AUC73.99DTTDNet
3DDTTD-MobileADD-S AUC88.1DTTDNet
3DYCB-VideoADD-S (2cm)96.14DTTDNet
3DYCB-VideoADD-S AUC94.19DTTDNet
3D Object DetectionDTTD-MobileADD AUC73.99DTTDNet
3D Object DetectionDTTD-MobileADD-S AUC88.1DTTDNet
6D Pose EstimationYCB-VideoADDS AUC94.19DTTD-Net w/o refiner
6D Pose EstimationDTTD-MobileADD AUC73.99DTTDNet
6D Pose EstimationDTTD-MobileADD-S AUC88.1DTTDNet
2D ClassificationDTTD-MobileADD AUC73.99DTTDNet
2D ClassificationDTTD-MobileADD-S AUC88.1DTTDNet
2D Object DetectionDTTD-MobileADD AUC73.99DTTDNet
2D Object DetectionDTTD-MobileADD-S AUC88.1DTTDNet
1 Image, 2*2 StitchiYCB-VideoADDS AUC94.19DTTD-Net w/o refiner
1 Image, 2*2 StitchiDTTD-MobileADD AUC73.99DTTDNet
1 Image, 2*2 StitchiDTTD-MobileADD-S AUC88.1DTTDNet
1 Image, 2*2 StitchiYCB-VideoADD-S (2cm)96.14DTTDNet
1 Image, 2*2 StitchiYCB-VideoADD-S AUC94.19DTTDNet
16kDTTD-MobileADD AUC73.99DTTDNet
16kDTTD-MobileADD-S AUC88.1DTTDNet

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16