TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Vote from the Center: 6 DoF Pose Estimation in RGB-D Image...

Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting

Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan

2021-04-066D Pose Estimation using RGBD
PaperPDFCode(official)

Abstract

We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for fewer, more disperse keypoints. The scheme is based upon the distance between points, which as a 1D quantity can be regressed more accurately than the 2D and 3D vector and offset quantities regressed in previous work, yielding more accurate keypoint localization. The scheme forms the basis of the proposed RCVPose method for 6 DoF pose estimation of 3D objects in RGB-D data, which is particularly effective at handling occlusions. A CNN is trained to estimate the distance between the 3D point corresponding to the depth mode of each RGB pixel, and a set of 3 disperse keypoints defined in the object frame. At inference, a sphere centered at each 3D point is generated, of radius equal to this estimated distance. The surfaces of these spheres vote to increment a 3D accumulator space, the peaks of which indicate keypoint locations. The proposed radial voting scheme is more accurate than previous vector or offset schemes, and is robust to disperse keypoints. Experiments demonstrate RCVPose to be highly accurate and competitive, achieving state-of-the-art results on the LINEMOD 99.7% and YCB-Video 97.2% datasets, notably scoring +4.9% higher 71.1% than previous methods on the challenging Occlusion LINEMOD dataset, and on average outperforming all other published results from the BOP benchmark for these 3 datasets. Our code is available at http://www.github.com/aaronwool/rcvpose.

Results

TaskDatasetMetricValueModel
Pose EstimationYCB-VideoADD(S) AUC96.6RCVPose
Pose EstimationYCB-VideoADD(S) AUC95.9RCVPose+ICP
Pose EstimationLineMODMean ADD99.7RCVPose+ICP
Pose EstimationLineMODMean ADD99.4RCVPose
Pose EstimationOccludedLINEMODMean ADD71.1RCVPose+ICP
Pose EstimationOccludedLINEMODMean ADD70.2RCVPose
3DYCB-VideoADD(S) AUC96.6RCVPose
3DYCB-VideoADD(S) AUC95.9RCVPose+ICP
3DLineMODMean ADD99.7RCVPose+ICP
3DLineMODMean ADD99.4RCVPose
3DOccludedLINEMODMean ADD71.1RCVPose+ICP
3DOccludedLINEMODMean ADD70.2RCVPose
1 Image, 2*2 StitchiYCB-VideoADD(S) AUC96.6RCVPose
1 Image, 2*2 StitchiYCB-VideoADD(S) AUC95.9RCVPose+ICP
1 Image, 2*2 StitchiLineMODMean ADD99.7RCVPose+ICP
1 Image, 2*2 StitchiLineMODMean ADD99.4RCVPose
1 Image, 2*2 StitchiOccludedLINEMODMean ADD71.1RCVPose+ICP
1 Image, 2*2 StitchiOccludedLINEMODMean ADD70.2RCVPose

Related Papers

Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset2023-09-24Enhancing 6-DoF Object Pose Estimation through Multiple Modality Fusion: A Hybrid CNN Architecture with Cross-Layer and Cross-Modal Integration2023-09-06The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes2023-06-29GenPose: Generative Category-level Object Pose Estimation via Diffusion Models2023-06-18BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects2023-03-24Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object Tracking2023-02-22Generative Category-Level Shape and Pose Estimation with Semantic Primitives2022-10-03A Multi-body Tracking Framework -- From Rigid Objects to Kinematic Structures2022-08-02