TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Robust and Smooth 3D Multi-Person Pose Estimation ...

Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild

Sungchan Park, Eunyi You, Inhoe Lee, Joonseok Lee

2023-09-15ICCV 2023 1Data AugmentationPose EstimationMulti-Person Pose Estimation3D Multi-Person Pose Estimation (root-relative)3D Multi-Person Pose Estimation (absolute)3D Pose Estimation3D Multi-Person Pose Estimation
PaperPDF

Abstract

3D pose estimation is an invaluable task in computer vision with various practical applications. Especially, 3D pose estimation for multi-person from a monocular video (3DMPPE) is particularly challenging and is still largely uncharted, far from applying to in-the-wild scenarios yet. We pose three unresolved issues with the existing methods: lack of robustness on unseen views during training, vulnerability to occlusion, and severe jittering in the output. As a remedy, we propose POTR-3D, the first realization of a sequence-to-sequence 2D-to-3D lifting model for 3DMPPE, powered by a novel geometry-aware data augmentation strategy, capable of generating unbounded data with a variety of views while caring about the ground plane and occlusions. Through extensive experiments, we verify that the proposed model and data augmentation robustly generalizes to diverse unseen views, robustly recovers the poses against heavy occlusions, and reliably generates more natural and smoother outputs. The effectiveness of our approach is verified not only by achieving the state-of-the-art performance on public benchmarks, but also by qualitative results on more challenging in-the-wild videos. Demo videos are available at https://www.youtube.com/@potr3d.

Results

TaskDatasetMetricValueModel
3D Multi-Person Pose Estimation (root-relative)MuPoTS-3D3DPCK83.7POTR-3D
3D Human Pose EstimationMuPoTS-3D3DPCK50.9POTR-3D
3D Human Pose EstimationMuPoTS-3D3DPCK83.7POTR-3D
3D Multi-Person Pose Estimation (absolute)MuPoTS-3D3DPCK50.9POTR-3D
Pose EstimationMuPoTS-3D3DPCK50.9POTR-3D
Pose EstimationMuPoTS-3D3DPCK83.7POTR-3D
3DMuPoTS-3D3DPCK50.9POTR-3D
3DMuPoTS-3D3DPCK83.7POTR-3D
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK50.9POTR-3D
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK83.7POTR-3D
1 Image, 2*2 StitchiMuPoTS-3D3DPCK50.9POTR-3D
1 Image, 2*2 StitchiMuPoTS-3D3DPCK83.7POTR-3D

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16