TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EPro-PnP: Generalized End-to-End Probabilistic Perspective...

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Hansheng Chen, Wei Tian, Pichao Wang, Fan Wang, Lu Xiong, Hao Li

2023-03-22Pose Estimation6D Pose Estimation using RGBobject-detection3D Object DetectionObject Detection
PaperPDFCodeCode(official)

Abstract

Locating 3D objects from a single RGB image via Perspective-n-Point (PnP) is a long-standing problem in computer vision. Driven by end-to-end deep learning, recent studies suggest interpreting PnP as a differentiable layer, allowing for partial learning of 2D-3D point correspondences by backpropagating the gradients of pose loss. Yet, learning the entire correspondences from scratch is highly challenging, particularly for ambiguous pose solutions, where the globally optimal pose is theoretically non-differentiable w.r.t. the points. In this paper, we propose the EPro-PnP, a probabilistic PnP layer for general end-to-end pose estimation, which outputs a distribution of pose with differentiable probability density on the SE(3) manifold. The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution. The underlying principle generalizes previous approaches, and resembles the attention mechanism. EPro-PnP can enhance existing correspondence networks, closing the gap between PnP-based method and the task-specific leaders on the LineMOD 6DoF pose estimation benchmark. Furthermore, EPro-PnP helps to explore new possibilities of network design, as we demonstrate a novel deformable correspondence network with the state-of-the-art pose accuracy on the nuScenes 3D object detection benchmark. Our code is available at https://github.com/tjiiv-cprg/EPro-PnP-v2.

Results

TaskDatasetMetricValueModel
Pose EstimationLineMODMean ADD96.36EPro-PnP-6DoF v2
Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
Object DetectionnuScenesmAAE0.123EPro-PnP-Det v2
Object DetectionnuScenesmAOE0.302EPro-PnP-Det v2
Object DetectionnuScenesmAP0.423EPro-PnP-Det v2
Object DetectionnuScenesmASE0.236EPro-PnP-Det v2
Object DetectionnuScenesmATE0.547EPro-PnP-Det v2
Object DetectionnuScenesmAVE1.071EPro-PnP-Det v2
Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
3DnuScenesNDS0.49EPro-PnP-Det v2
3DnuScenesmAAE0.123EPro-PnP-Det v2
3DnuScenesmAOE0.302EPro-PnP-Det v2
3DnuScenesmAP0.423EPro-PnP-Det v2
3DnuScenesmASE0.236EPro-PnP-Det v2
3DnuScenesmATE0.547EPro-PnP-Det v2
3DnuScenesmAVE1.071EPro-PnP-Det v2
3DnuScenesNDS0.49EPro-PnP-Det v2
3DLineMODMean ADD96.36EPro-PnP-6DoF v2
3D Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
3D Object DetectionnuScenesmAAE0.123EPro-PnP-Det v2
3D Object DetectionnuScenesmAOE0.302EPro-PnP-Det v2
3D Object DetectionnuScenesmAP0.423EPro-PnP-Det v2
3D Object DetectionnuScenesmASE0.236EPro-PnP-Det v2
3D Object DetectionnuScenesmATE0.547EPro-PnP-Det v2
3D Object DetectionnuScenesmAVE1.071EPro-PnP-Det v2
3D Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
2D ClassificationnuScenesNDS0.49EPro-PnP-Det v2
2D ClassificationnuScenesmAAE0.123EPro-PnP-Det v2
2D ClassificationnuScenesmAOE0.302EPro-PnP-Det v2
2D ClassificationnuScenesmAP0.423EPro-PnP-Det v2
2D ClassificationnuScenesmASE0.236EPro-PnP-Det v2
2D ClassificationnuScenesmATE0.547EPro-PnP-Det v2
2D ClassificationnuScenesmAVE1.071EPro-PnP-Det v2
2D ClassificationnuScenesNDS0.49EPro-PnP-Det v2
2D Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
2D Object DetectionnuScenesmAAE0.123EPro-PnP-Det v2
2D Object DetectionnuScenesmAOE0.302EPro-PnP-Det v2
2D Object DetectionnuScenesmAP0.423EPro-PnP-Det v2
2D Object DetectionnuScenesmASE0.236EPro-PnP-Det v2
2D Object DetectionnuScenesmATE0.547EPro-PnP-Det v2
2D Object DetectionnuScenesmAVE1.071EPro-PnP-Det v2
2D Object DetectionnuScenesNDS0.49EPro-PnP-Det v2
1 Image, 2*2 StitchiLineMODMean ADD96.36EPro-PnP-6DoF v2
16knuScenesNDS0.49EPro-PnP-Det v2
16knuScenesmAAE0.123EPro-PnP-Det v2
16knuScenesmAOE0.302EPro-PnP-Det v2
16knuScenesmAP0.423EPro-PnP-Det v2
16knuScenesmASE0.236EPro-PnP-Det v2
16knuScenesmATE0.547EPro-PnP-Det v2
16knuScenesmAVE1.071EPro-PnP-Det v2
16knuScenesNDS0.49EPro-PnP-Det v2

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17