6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints

Chen Wang, Roberto Martín-Martín, Danfei Xu, Jun Lv, Cewu Lu, Li Fei-Fei, Silvio Savarese, Yuke Zhu

2019-10-23Pose Estimation Pose Tracking 6D Pose Estimation using RGBD 6D Pose Estimation

Abstract

We present 6-PACK, a deep learning approach to category-level 6D object pose tracking on RGB-D data. Our method tracks in real-time novel object instances of known object categories such as bowls, laptops, and mugs. 6-PACK learns to compactly represent an object by a handful of 3D keypoints, based on which the interframe motion of an object instance can be estimated through keypoint matching. These keypoints are learned end-to-end without manual supervision in order to be most effective for tracking. Our experiments show that our method substantially outperforms existing methods on the NOCS category-level 6D pose estimation benchmark and supports a physical robot to perform simple vision-based closed-loop manipulation tasks. Our code and video are available at https://sites.google.com/view/6packtracking.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	REAL275	Rerr	16	6-PACK
Pose Estimation	REAL275	Terr	3.5	6-PACK
Pose Estimation	REAL275	mAP 3DIou@25	94.2	6-PACK
Pose Estimation	REAL275	mAP 5, 5cm	33.3	6-PACK
3D	REAL275	Rerr	16	6-PACK
3D	REAL275	Terr	3.5	6-PACK
3D	REAL275	mAP 3DIou@25	94.2	6-PACK
3D	REAL275	mAP 5, 5cm	33.3	6-PACK
1 Image, 2*2 Stitchi	REAL275	Rerr	16	6-PACK
1 Image, 2*2 Stitchi	REAL275	Terr	3.5	6-PACK
1 Image, 2*2 Stitchi	REAL275	mAP 3DIou@25	94.2	6-PACK
1 Image, 2*2 Stitchi	REAL275	mAP 5, 5cm	33.3	6-PACK

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17 AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17 SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16