TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Permutation-Invariant Relational Network for Multi-person ...

Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation

Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

2022-04-11Pose Estimation3D Multi-Person Pose Estimation (root-relative)3D Multi-Person Pose Estimation (absolute)3D Pose Estimation3D Multi-Person Pose Estimation
PaperPDF

Abstract

The recovery of multi-person 3D poses from a single RGB image is a severely ill-conditioned problem due to the inherent 2D-3D depth ambiguity, inter-person occlusions, and body truncations. To tackle these issues, recent works have shown promising results by simultaneously reasoning for different people. However, in most cases this is done by only considering pairwise person interactions, hindering thus a holistic scene representation able to capture long-range interactions. This is addressed by approaches that jointly process all people in the scene, although they require defining one of the individuals as a reference and a pre-defined person ordering, being sensitive to this choice. In this paper, we overcome both these limitations, and we propose an approach for multi-person 3D pose estimation that captures long-range interactions independently of the input order. For this purpose, we build a residual-like permutation-invariant network that successfully refines potentially corrupted initial 3D poses estimated by an off-the-shelf detector. The residual function is learned via Set Transformer blocks, that model the interactions among all initial poses, no matter their ordering or number. A thorough evaluation demonstrates that our approach is able to boost the performance of the initially estimated 3D poses by large margins, achieving state-of-the-art results on standardized benchmarks. Additionally, the proposed module works in a computationally efficient manner and can be potentially used as a drop-in complement for any 3D pose detector in multi-people scenes.

Results

TaskDatasetMetricValueModel
3D Multi-Person Pose Estimation (root-relative)MuPoTS-3D3DPCK85.8PIRN
3D Human Pose EstimationPanopticAverage MPJPE (mm)49.8PIRN
3D Human Pose EstimationMuPoTS-3D3DPCK44.1PIRN
3D Human Pose EstimationMuPoTS-3D3DPCK85.8PIRN
3D Multi-Person Pose Estimation (absolute)MuPoTS-3D3DPCK44.1PIRN
Pose EstimationPanopticAverage MPJPE (mm)49.8PIRN
Pose EstimationMuPoTS-3D3DPCK44.1PIRN
Pose EstimationMuPoTS-3D3DPCK85.8PIRN
3DPanopticAverage MPJPE (mm)49.8PIRN
3DMuPoTS-3D3DPCK44.1PIRN
3DMuPoTS-3D3DPCK85.8PIRN
3D Multi-Person Pose EstimationPanopticAverage MPJPE (mm)49.8PIRN
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK44.1PIRN
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK85.8PIRN
1 Image, 2*2 StitchiPanopticAverage MPJPE (mm)49.8PIRN
1 Image, 2*2 StitchiMuPoTS-3D3DPCK44.1PIRN
1 Image, 2*2 StitchiMuPoTS-3D3DPCK85.8PIRN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16