Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Zixun Huang, Keling Yao, Seth Z. Zhao, Chuanyu Pan, Chenfeng Xu, Kathy Zhuang, Tianjian Xu, Weiyu Feng, Allen Y. Yang

2023-09-243D Object Tracking Pose Estimation Object Tracking 6D Pose Estimation using RGBD 3D Object Detection 6D Pose Estimation

Paper PDF Code(official)Code(official)Code

Abstract

Robust 6DoF pose estimation with mobile devices is the foundation for applications in robotics, augmented reality, and digital twin localization. In this paper, we extensively investigate the robustness of existing RGBD-based 6DoF pose estimation methods against varying levels of depth sensor noise. We highlight that existing 6DoF pose estimation methods suffer significant performance discrepancies due to depth measurement inaccuracies. In response to the robustness issue, we present a simple and effective transformer-based 6DoF pose estimation approach called DTTDNet, featuring a novel geometric feature filtering module and a Chamfer distance loss for training. Moreover, we advance the field of robust 6DoF pose estimation and introduce a new dataset -- Digital Twin Tracking Dataset Mobile (DTTD-Mobile), tailored for digital twin object tracking with noisy depth data from the mobile RGBD sensor suite of the Apple iPhone 14 Pro. Extensive experiments demonstrate that DTTDNet significantly outperforms state-of-the-art methods at least 4.32, up to 60.74 points in ADD metrics on the DTTD-Mobile. More importantly, our approach exhibits superior robustness to varying levels of measurement noise, setting a new benchmark for the robustness to noise measurements. Code and dataset are made publicly available at: https://github.com/augcog/DTTD2

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	YCB-Video	ADDS AUC	94.19	DTTD-Net w/o refiner
Pose Estimation	DTTD-Mobile	ADD AUC	73.99	DTTDNet
Pose Estimation	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
Pose Estimation	YCB-Video	ADD-S (2cm)	96.14	DTTDNet
Pose Estimation	YCB-Video	ADD-S AUC	94.19	DTTDNet
Object Detection	DTTD-Mobile	ADD AUC	73.99	DTTDNet
Object Detection	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
3D	DTTD-Mobile	ADD AUC	73.99	DTTDNet
3D	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
3D	YCB-Video	ADDS AUC	94.19	DTTD-Net w/o refiner
3D	DTTD-Mobile	ADD AUC	73.99	DTTDNet
3D	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
3D	YCB-Video	ADD-S (2cm)	96.14	DTTDNet
3D	YCB-Video	ADD-S AUC	94.19	DTTDNet
3D Object Detection	DTTD-Mobile	ADD AUC	73.99	DTTDNet
3D Object Detection	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
6D Pose Estimation	YCB-Video	ADDS AUC	94.19	DTTD-Net w/o refiner
6D Pose Estimation	DTTD-Mobile	ADD AUC	73.99	DTTDNet
6D Pose Estimation	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
2D Classification	DTTD-Mobile	ADD AUC	73.99	DTTDNet
2D Classification	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
2D Object Detection	DTTD-Mobile	ADD AUC	73.99	DTTDNet
2D Object Detection	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
1 Image, 2*2 Stitchi	YCB-Video	ADDS AUC	94.19	DTTD-Net w/o refiner
1 Image, 2*2 Stitchi	DTTD-Mobile	ADD AUC	73.99	DTTDNet
1 Image, 2*2 Stitchi	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet
1 Image, 2*2 Stitchi	YCB-Video	ADD-S (2cm)	96.14	DTTDNet
1 Image, 2*2 Stitchi	YCB-Video	ADD-S AUC	94.19	DTTDNet
16k	DTTD-Mobile	ADD AUC	73.99	DTTDNet
16k	DTTD-Mobile	ADD-S AUC	88.1	DTTDNet

Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Abstract

Results

Related Papers

Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Abstract

Results

Related Papers