HybridPose: 6D Object Pose Estimation under Hybrid Representations

Chen Song, Jiaru Song, Qi-Xing Huang

2020-01-07CVPR 2020 6regression Pose Estimation 6D Pose Estimation using RGB

Abstract

We introduce HybridPose, a novel 6D object pose estimation approach. HybridPose utilizes a hybrid intermediate representation to express different geometric information in the input image, including keypoints, edge vectors, and symmetry correspondences. Compared to a unitary representation, our hybrid representation allows pose regression to exploit more and diverse features when one type of predicted representation is inaccurate (e.g., because of occlusion). Different intermediate representations used by HybridPose can all be predicted by the same simple neural network, and outliers in predicted intermediate representations are filtered by a robust regression module. Compared to state-of-the-art pose estimation approaches, HybridPose is comparable in running time and accuracy. For example, on Occlusion Linemod dataset, our method achieves a prediction speed of 30 fps with a mean ADD(-S) accuracy of 47.5%, representing a state-of-the-art performance. The implementation of HybridPose is available at https://github.com/chensong1995/HybridPose.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	LineMOD	Mean ADD	91.3	HybridPose
Pose Estimation	Occlusion LineMOD	Mean ADD	47.5	HybridPose
3D	LineMOD	Mean ADD	91.3	HybridPose
3D	Occlusion LineMOD	Mean ADD	47.5	HybridPose
1 Image, 2*2 Stitchi	LineMOD	Mean ADD	91.3	HybridPose
1 Image, 2*2 Stitchi	Occlusion LineMOD	Mean ADD	47.5	HybridPose

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20 $π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17 AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17 Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16 Imbalanced Regression Pipeline Recommendation2025-07-16