TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OmniPose: A Multi-Scale Framework for Multi-Person Pose Es...

OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation

Bruno Artacho, Andreas Savakis

2021-03-18Pose Estimation
PaperPDFCode(official)

Abstract

We propose OmniPose, a single-pass, end-to-end trainable framework, that achieves state-of-the-art results for multi-person pose estimation. Using a novel waterfall module, the OmniPose architecture leverages multi-scale feature representations that increase the effectiveness of backbone feature extractors, without the need for post-processing. OmniPose incorporates contextual information across scales and joint localization with Gaussian heatmap modulation at the multi-scale feature extractor to estimate human pose with state-of-the-art accuracy. The multi-scale representations, obtained by the improved waterfall module in OmniPose, leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations. Our results on multiple datasets demonstrate that OmniPose, with an improved HRNet backbone and waterfall module, is a robust and efficient architecture for multi-person pose estimation that achieves state-of-the-art results.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO (Common Objects in Context)AP79.5OmniPose (WASPv2)
Pose EstimationCOCO (Common Objects in Context)AP5093.6OmniPose (WASPv2)
Pose EstimationCOCO (Common Objects in Context)AP7585.9OmniPose (WASPv2)
Pose EstimationCOCO (Common Objects in Context)APL84.6OmniPose (WASPv2)
Pose EstimationCOCO (Common Objects in Context)APM76OmniPose (WASPv2)
Pose EstimationCOCO (Common Objects in Context)AR81.9OmniPose (WASPv2)
Pose EstimationUPenn ActionMean PCK@0.299.4OmniPose
Pose EstimationCOCO test-devAP76.4OmniPose (WASPv2)
Pose EstimationCOCO test-devAP5092.6OmniPose (WASPv2)
Pose EstimationCOCO test-devAP7583.7OmniPose (WASPv2)
Pose EstimationCOCO test-devAPL82.6OmniPose (WASPv2)
Pose EstimationCOCO test-devAPM72.6OmniPose (WASPv2)
Pose EstimationCOCO test-devAR81.2OmniPose (WASPv2)
Pose EstimationMPIIPCKh@0.292.3OmniPose (WASPv2)
3DCOCO (Common Objects in Context)AP79.5OmniPose (WASPv2)
3DCOCO (Common Objects in Context)AP5093.6OmniPose (WASPv2)
3DCOCO (Common Objects in Context)AP7585.9OmniPose (WASPv2)
3DCOCO (Common Objects in Context)APL84.6OmniPose (WASPv2)
3DCOCO (Common Objects in Context)APM76OmniPose (WASPv2)
3DCOCO (Common Objects in Context)AR81.9OmniPose (WASPv2)
3DUPenn ActionMean PCK@0.299.4OmniPose
3DCOCO test-devAP76.4OmniPose (WASPv2)
3DCOCO test-devAP5092.6OmniPose (WASPv2)
3DCOCO test-devAP7583.7OmniPose (WASPv2)
3DCOCO test-devAPL82.6OmniPose (WASPv2)
3DCOCO test-devAPM72.6OmniPose (WASPv2)
3DCOCO test-devAR81.2OmniPose (WASPv2)
3DMPIIPCKh@0.292.3OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP79.5OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP5093.6OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP7585.9OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)APL84.6OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)APM76OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AR81.9OmniPose (WASPv2)
1 Image, 2*2 StitchiUPenn ActionMean PCK@0.299.4OmniPose
1 Image, 2*2 StitchiCOCO test-devAP76.4OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO test-devAP5092.6OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO test-devAP7583.7OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO test-devAPL82.6OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO test-devAPM72.6OmniPose (WASPv2)
1 Image, 2*2 StitchiCOCO test-devAR81.2OmniPose (WASPv2)
1 Image, 2*2 StitchiMPIIPCKh@0.292.3OmniPose (WASPv2)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16