Bruno Artacho, Andreas Savakis
We propose OmniPose, a single-pass, end-to-end trainable framework, that achieves state-of-the-art results for multi-person pose estimation. Using a novel waterfall module, the OmniPose architecture leverages multi-scale feature representations that increase the effectiveness of backbone feature extractors, without the need for post-processing. OmniPose incorporates contextual information across scales and joint localization with Gaussian heatmap modulation at the multi-scale feature extractor to estimate human pose with state-of-the-art accuracy. The multi-scale representations, obtained by the improved waterfall module in OmniPose, leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations. Our results on multiple datasets demonstrate that OmniPose, with an improved HRNet backbone and waterfall module, is a robust and efficient architecture for multi-person pose estimation that achieves state-of-the-art results.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Pose Estimation | COCO (Common Objects in Context) | AP | 79.5 | OmniPose (WASPv2) |
| Pose Estimation | COCO (Common Objects in Context) | AP50 | 93.6 | OmniPose (WASPv2) |
| Pose Estimation | COCO (Common Objects in Context) | AP75 | 85.9 | OmniPose (WASPv2) |
| Pose Estimation | COCO (Common Objects in Context) | APL | 84.6 | OmniPose (WASPv2) |
| Pose Estimation | COCO (Common Objects in Context) | APM | 76 | OmniPose (WASPv2) |
| Pose Estimation | COCO (Common Objects in Context) | AR | 81.9 | OmniPose (WASPv2) |
| Pose Estimation | UPenn Action | Mean PCK@0.2 | 99.4 | OmniPose |
| Pose Estimation | COCO test-dev | AP | 76.4 | OmniPose (WASPv2) |
| Pose Estimation | COCO test-dev | AP50 | 92.6 | OmniPose (WASPv2) |
| Pose Estimation | COCO test-dev | AP75 | 83.7 | OmniPose (WASPv2) |
| Pose Estimation | COCO test-dev | APL | 82.6 | OmniPose (WASPv2) |
| Pose Estimation | COCO test-dev | APM | 72.6 | OmniPose (WASPv2) |
| Pose Estimation | COCO test-dev | AR | 81.2 | OmniPose (WASPv2) |
| Pose Estimation | MPII | PCKh@0.2 | 92.3 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | AP | 79.5 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | AP50 | 93.6 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | AP75 | 85.9 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | APL | 84.6 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | APM | 76 | OmniPose (WASPv2) |
| 3D | COCO (Common Objects in Context) | AR | 81.9 | OmniPose (WASPv2) |
| 3D | UPenn Action | Mean PCK@0.2 | 99.4 | OmniPose |
| 3D | COCO test-dev | AP | 76.4 | OmniPose (WASPv2) |
| 3D | COCO test-dev | AP50 | 92.6 | OmniPose (WASPv2) |
| 3D | COCO test-dev | AP75 | 83.7 | OmniPose (WASPv2) |
| 3D | COCO test-dev | APL | 82.6 | OmniPose (WASPv2) |
| 3D | COCO test-dev | APM | 72.6 | OmniPose (WASPv2) |
| 3D | COCO test-dev | AR | 81.2 | OmniPose (WASPv2) |
| 3D | MPII | PCKh@0.2 | 92.3 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | AP | 79.5 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | AP50 | 93.6 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | AP75 | 85.9 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | APL | 84.6 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | APM | 76 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | AR | 81.9 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | UPenn Action | Mean PCK@0.2 | 99.4 | OmniPose |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP | 76.4 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP50 | 92.6 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP75 | 83.7 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO test-dev | APL | 82.6 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO test-dev | APM | 72.6 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | COCO test-dev | AR | 81.2 | OmniPose (WASPv2) |
| 1 Image, 2*2 Stitchi | MPII | PCKh@0.2 | 92.3 | OmniPose (WASPv2) |