TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/GigaPose: Fast and Robust Novel Object Pose Estimation via...

GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence

Van Nguyen Nguyen, Thibault Groueix, Mathieu Salzmann, Vincent Lepetit

2023-11-23CVPR 2024 1Pose Estimation3D Reconstruction6D Pose Estimation
PaperPDFCode(official)

Abstract

We present GigaPose, a fast, robust, and accurate method for CAD-based novel object pose estimation in RGB images. GigaPose first leverages discriminative "templates", rendered images of the CAD models, to recover the out-of-plane rotation and then uses patch correspondences to estimate the four remaining parameters. Our approach samples templates in only a two-degrees-of-freedom space instead of the usual three and matches the input image to the templates using fast nearest-neighbor search in feature space, results in a speedup factor of 35x compared to the state of the art. Moreover, GigaPose is significantly more robust to segmentation errors. Our extensive evaluation on the seven core datasets of the BOP challenge demonstrates that it achieves state-of-the-art accuracy and can be seamlessly integrated with existing refinement methods. Additionally, we show the potential of GigaPose with 3D models predicted by recent work on 3D reconstruction from a single image, relaxing the need for CAD models and making 6D pose object estimation much more convenient. Our source code and trained models are publicly available at https://github.com/nv-nguyen/gigaPose

Results

TaskDatasetMetricValueModel
Pose EstimationDTTD-MobileAR CH12.16GigaPose
Pose EstimationDTTD-MobileAR CoU39.32GigaPose
Pose EstimationDTTD-MobileAR pCH74GigaPose
3DDTTD-MobileAR CH12.16GigaPose
3DDTTD-MobileAR CoU39.32GigaPose
3DDTTD-MobileAR pCH74GigaPose
6D Pose EstimationDTTD-MobileAR CH12.16GigaPose
6D Pose EstimationDTTD-MobileAR CoU39.32GigaPose
6D Pose EstimationDTTD-MobileAR pCH74GigaPose
1 Image, 2*2 StitchiDTTD-MobileAR CH12.16GigaPose
1 Image, 2*2 StitchiDTTD-MobileAR CoU39.32GigaPose
1 Image, 2*2 StitchiDTTD-MobileAR pCH74GigaPose

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16