Occlusion-Robust Object Pose Estimation with Holistic Representation

Bo Chen, Tat-Jun Chin, Marius Klimavicius

2021-10-22Representation Learning Pose Estimation 6D Pose Estimation using RGB

Abstract

Practical object pose estimation demands robustness against occlusions to the target object. State-of-the-art (SOTA) object pose estimators take a two-stage approach, where the first stage predicts 2D landmarks using a deep network and the second stage solves for 6DOF pose from 2D-3D correspondences. Albeit widely adopted, such two-stage approaches could suffer from novel occlusions when generalising and weak landmark coherence due to disrupted features. To address these issues, we develop a novel occlude-and-blackout batch augmentation technique to learn occlusion-robust deep features, and a multi-precision supervision architecture to encourage holistic pose representation learning for accurate and coherent landmark predictions. We perform careful ablation tests to verify the impact of our innovations and compare our method to SOTA pose estimators. Without the need of any post-processing or refinement, our method exhibits superior performance on the LINEMOD dataset. On the YCB-Video dataset our method outperforms all non-refinement methods in terms of the ADD(-S) metric. We also demonstrate the high data-efficiency of our method. Our code is available at http://github.com/BoChenYS/ROPE

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	YCB-Video	Mean ADD	66.59	ROPE
Pose Estimation	YCB-Video	Mean AUC	79.88	ROPE
Pose Estimation	LineMOD	Mean ADD	95.61	ROPE
Pose Estimation	Occlusion LineMOD	Mean ADD	45.95	ROPE
3D	YCB-Video	Mean ADD	66.59	ROPE
3D	YCB-Video	Mean AUC	79.88	ROPE
3D	LineMOD	Mean ADD	95.61	ROPE
3D	Occlusion LineMOD	Mean ADD	45.95	ROPE
1 Image, 2*2 Stitchi	YCB-Video	Mean ADD	66.59	ROPE
1 Image, 2*2 Stitchi	YCB-Video	Mean AUC	79.88	ROPE
1 Image, 2*2 Stitchi	LineMOD	Mean ADD	95.61	ROPE
1 Image, 2*2 Stitchi	Occlusion LineMOD	Mean ADD	45.95	ROPE

Occlusion-Robust Object Pose Estimation with Holistic Representation

Abstract

Results

Related Papers

Occlusion-Robust Object Pose Estimation with Holistic Representation

Abstract

Results

Related Papers