Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu
Multi-person pose estimation in the wild is challenging. Although state-of-the-art human detectors have demonstrated good performance, small errors in localization and recognition are inevitable. These errors can cause failures for a single-person pose estimator (SPPE), especially for methods that solely depend on human detection results. In this paper, we propose a novel regional multi-person pose estimation (RMPE) framework to facilitate pose estimation in the presence of inaccurate human bounding boxes. Our framework consists of three components: Symmetric Spatial Transformer Network (SSTN), Parametric Pose Non-Maximum-Suppression (NMS), and Pose-Guided Proposals Generator (PGPG). Our method is able to handle inaccurate bounding boxes and redundant detections, allowing it to achieve a 17% increase in mAP over the state-of-the-art methods on the MPII (multi person) dataset.Our model and source codes are publicly available.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Pose Estimation | OCHuman | Test AP | 30.7 | RMPE |
| Pose Estimation | OCHuman | Validation AP | 38.8 | RMPE |
| Pose Estimation | COCO test-dev | AP | 72.3 | RMPE++ |
| Pose Estimation | COCO test-dev | AP50 | 89.2 | RMPE++ |
| Pose Estimation | COCO test-dev | AP75 | 79.1 | RMPE++ |
| Pose Estimation | COCO test-dev | APL | 78.6 | RMPE++ |
| Pose Estimation | COCO test-dev | APM | 68 | RMPE++ |
| Pose Estimation | COCO test-dev | AP | 61.8 | RMPE |
| Pose Estimation | COCO test-dev | AP50 | 83.7 | RMPE |
| Pose Estimation | COCO test-dev | AP75 | 69.8 | RMPE |
| Pose Estimation | COCO test-dev | APL | 67.6 | RMPE |
| Pose Estimation | COCO test-dev | APM | 58.6 | RMPE |
| Pose Estimation | UAV-Human | mAP | 56.9 | AlphaPose |
| Pose Estimation | COCO test-dev | APL | 81.5 | AlphaPose |
| Pose Estimation | COCO (Common Objects in Context) | FPS | 23 | AlphaPose |
| Pose Estimation | COCO (Common Objects in Context) | Test AP | 73.3 | AlphaPose |
| Pose Estimation | OCHuman | Test AP | 30.7 | RMPE |
| Pose Estimation | OCHuman | Validation AP | 38.8 | RMPE |
| Pose Estimation | COCO test-dev | AP | 61.8 | RMPE |
| Pose Estimation | COCO test-dev | AP50 | 83.7 | RMPE |
| Pose Estimation | COCO test-dev | AP75 | 69.8 | RMPE |
| Pose Estimation | COCO test-dev | APL | 67.6 | RMPE |
| Pose Estimation | COCO test-dev | APM | 58.6 | RMPE |
| Pose Estimation | CrowdPose | AP Easy | 71.2 | AlphaPose |
| Pose Estimation | CrowdPose | AP Hard | 51.1 | AlphaPose |
| Pose Estimation | CrowdPose | AP Medium | 61.4 | AlphaPose |
| Pose Estimation | CrowdPose | mAP @0.5:0.95 | 61 | AlphaPose |
| 3D | OCHuman | Test AP | 30.7 | RMPE |
| 3D | OCHuman | Validation AP | 38.8 | RMPE |
| 3D | COCO test-dev | AP | 72.3 | RMPE++ |
| 3D | COCO test-dev | AP50 | 89.2 | RMPE++ |
| 3D | COCO test-dev | AP75 | 79.1 | RMPE++ |
| 3D | COCO test-dev | APL | 78.6 | RMPE++ |
| 3D | COCO test-dev | APM | 68 | RMPE++ |
| 3D | COCO test-dev | AP | 61.8 | RMPE |
| 3D | COCO test-dev | AP50 | 83.7 | RMPE |
| 3D | COCO test-dev | AP75 | 69.8 | RMPE |
| 3D | COCO test-dev | APL | 67.6 | RMPE |
| 3D | COCO test-dev | APM | 58.6 | RMPE |
| 3D | UAV-Human | mAP | 56.9 | AlphaPose |
| 3D | COCO test-dev | APL | 81.5 | AlphaPose |
| 3D | COCO (Common Objects in Context) | FPS | 23 | AlphaPose |
| 3D | COCO (Common Objects in Context) | Test AP | 73.3 | AlphaPose |
| 3D | OCHuman | Test AP | 30.7 | RMPE |
| 3D | OCHuman | Validation AP | 38.8 | RMPE |
| 3D | COCO test-dev | AP | 61.8 | RMPE |
| 3D | COCO test-dev | AP50 | 83.7 | RMPE |
| 3D | COCO test-dev | AP75 | 69.8 | RMPE |
| 3D | COCO test-dev | APL | 67.6 | RMPE |
| 3D | COCO test-dev | APM | 58.6 | RMPE |
| 3D | CrowdPose | AP Easy | 71.2 | AlphaPose |
| 3D | CrowdPose | AP Hard | 51.1 | AlphaPose |
| 3D | CrowdPose | AP Medium | 61.4 | AlphaPose |
| 3D | CrowdPose | mAP @0.5:0.95 | 61 | AlphaPose |
| 2D Human Pose Estimation | OCHuman | Test AP | 30.7 | RMPE |
| 2D Human Pose Estimation | OCHuman | Validation AP | 38.8 | RMPE |
| Multi-Person Pose Estimation | COCO test-dev | AP | 61.8 | RMPE |
| Multi-Person Pose Estimation | COCO test-dev | AP50 | 83.7 | RMPE |
| Multi-Person Pose Estimation | COCO test-dev | AP75 | 69.8 | RMPE |
| Multi-Person Pose Estimation | COCO test-dev | APL | 67.6 | RMPE |
| Multi-Person Pose Estimation | COCO test-dev | APM | 58.6 | RMPE |
| Multi-Person Pose Estimation | CrowdPose | AP Easy | 71.2 | AlphaPose |
| Multi-Person Pose Estimation | CrowdPose | AP Hard | 51.1 | AlphaPose |
| Multi-Person Pose Estimation | CrowdPose | AP Medium | 61.4 | AlphaPose |
| Multi-Person Pose Estimation | CrowdPose | mAP @0.5:0.95 | 61 | AlphaPose |
| 1 Image, 2*2 Stitchi | OCHuman | Test AP | 30.7 | RMPE |
| 1 Image, 2*2 Stitchi | OCHuman | Validation AP | 38.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP | 72.3 | RMPE++ |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP50 | 89.2 | RMPE++ |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP75 | 79.1 | RMPE++ |
| 1 Image, 2*2 Stitchi | COCO test-dev | APL | 78.6 | RMPE++ |
| 1 Image, 2*2 Stitchi | COCO test-dev | APM | 68 | RMPE++ |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP | 61.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP50 | 83.7 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP75 | 69.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | APL | 67.6 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | APM | 58.6 | RMPE |
| 1 Image, 2*2 Stitchi | UAV-Human | mAP | 56.9 | AlphaPose |
| 1 Image, 2*2 Stitchi | COCO test-dev | APL | 81.5 | AlphaPose |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | FPS | 23 | AlphaPose |
| 1 Image, 2*2 Stitchi | COCO (Common Objects in Context) | Test AP | 73.3 | AlphaPose |
| 1 Image, 2*2 Stitchi | OCHuman | Test AP | 30.7 | RMPE |
| 1 Image, 2*2 Stitchi | OCHuman | Validation AP | 38.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP | 61.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP50 | 83.7 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | AP75 | 69.8 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | APL | 67.6 | RMPE |
| 1 Image, 2*2 Stitchi | COCO test-dev | APM | 58.6 | RMPE |
| 1 Image, 2*2 Stitchi | CrowdPose | AP Easy | 71.2 | AlphaPose |
| 1 Image, 2*2 Stitchi | CrowdPose | AP Hard | 51.1 | AlphaPose |
| 1 Image, 2*2 Stitchi | CrowdPose | AP Medium | 61.4 | AlphaPose |
| 1 Image, 2*2 Stitchi | CrowdPose | mAP @0.5:0.95 | 61 | AlphaPose |