GDRNPP: A Geometry-guided and Fully Learning-based Object Pose Estimator

Xingyu Liu, Ruida Zhang, Chenyangguang Zhang, Gu Wang, Jiwen Tang, Zhigang Li, Xiangyang Ji

2021-02-24CVPR 2021 1regression Pose Estimation 6D Pose Estimation using RGB 6D Pose Estimation

Abstract

6D pose estimation of rigid objects is a long-standing and challenging task in computer vision. Recently, the emergence of deep learning reveals the potential of Convolutional Neural Networks (CNNs) to predict reliable 6D poses. Given that direct pose regression networks currently exhibit suboptimal performance, most methods still resort to traditional techniques to varying degrees. For example, top-performing methods often adopt an indirect strategy by first establishing 2D-3D or 3D-3D correspondences followed by applying the RANSAC-based PnP or Kabsch algorithms, and further employing ICP for refinement. Despite the performance enhancement, the integration of traditional techniques makes the networks time-consuming and not end-to-end trainable. Orthogonal to them, this paper introduces a fully learning-based object pose estimator. In this work, we first perform an in-depth investigation of both direct and indirect methods and propose a simple yet effective Geometry-guided Direct Regression Network (GDRN) to learn the 6D pose from monocular images in an end-to-end manner. Afterwards, we introduce a geometry-guided pose refinement module, enhancing pose accuracy when extra depth data is available. Guided by the predicted coordinate map, we build an end-to-end differentiable architecture that establishes robust and accurate 3D-3D correspondences between the observed and rendered RGB-D images to refine the pose. Our enhanced pose estimation pipeline GDRNPP (GDRN Plus Plus) conquered the leaderboard of the BOP Challenge for two consecutive years, becoming the first to surpass all prior methods that relied on traditional techniques in both accuracy and speed. The code and models are available at https://github.com/shanice-l/gdrnpp_bop2022.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	Occlusion LineMOD	Mean ADD	56.1	GDR-Net
3D	Occlusion LineMOD	Mean ADD	56.1	GDR-Net
1 Image, 2*2 Stitchi	Occlusion LineMOD	Mean ADD	56.1	GDR-Net

GDRNPP: A Geometry-guided and Fully Learning-based Object Pose Estimator

Abstract

Results

Related Papers

GDRNPP: A Geometry-guided and Fully Learning-based Object Pose Estimator

Abstract

Results

Related Papers