Towards Accurate Multi-person Pose Estimation in the Wild

George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

2017-01-06CVPR 2017 7Human Detection Pose Estimation Multi-Person Pose Estimation Keypoint Detection

Abstract

We propose a method for multi-person detection and 2-D pose estimation that achieves state-of-art results on the challenging COCO keypoints task. It is a simple, yet powerful, top-down approach consisting of two stages. In the first stage, we predict the location and scale of boxes which are likely to contain people; for this we use the Faster RCNN detector. In the second stage, we estimate the keypoints of the person potentially contained in each proposed bounding box. For each keypoint type we predict dense heatmaps and offsets using a fully convolutional ResNet. To combine these outputs we introduce a novel aggregation procedure to obtain highly localized keypoint predictions. We also use a novel form of keypoint-based Non-Maximum-Suppression (NMS), instead of the cruder box-level NMS, and a novel form of keypoint-based confidence score estimation, instead of box-level scoring. Trained on COCO data alone, our final system achieves average precision of 0.649 on the COCO test-dev set and the 0.643 test-standard sets, outperforming the winner of the 2016 COCO keypoints challenge and other recent state-of-art. Further, by using additional in-house labeled data we obtain an even higher average precision of 0.685 on the test-dev set and 0.673 on the test-standard set, more than 5% absolute improvement compared to the previous best performing method on the same dataset.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	COCO test-dev	AP	64.9	G-RMI
Pose Estimation	COCO test-dev	AP50	85.5	G-RMI
Pose Estimation	COCO test-dev	AP75	71.3	G-RMI
Pose Estimation	COCO test-dev	APL	70	G-RMI
Pose Estimation	COCO test-dev	AR	69.7	G-RMI
Pose Estimation	COCO test-dev	AP50	85.5	G-RMI
Pose Estimation	COCO test-dev	AP75	71.3	G-RMI
Pose Estimation	COCO test-dev	APL	70	G-RMI
Pose Estimation	COCO test-dev	APM	62.3	G-RMI
Pose Estimation	COCO test-dev	AR	69.7	G-RMI
Pose Estimation	COCO test-dev	AR50	88.7	G-RMI
Pose Estimation	COCO test-dev	AR75	75.5	G-RMI
Pose Estimation	COCO test-dev	ARL	77.1	G-RMI
Pose Estimation	COCO test-dev	ARM	64.4	G-RMI
Pose Estimation	COCO test-challenge	AP	69.1	G-RMI*
Pose Estimation	COCO test-challenge	AP50	85.9	G-RMI*
Pose Estimation	COCO test-challenge	AP75	75.2	G-RMI*
Pose Estimation	COCO test-challenge	APL	82.4	G-RMI*
Pose Estimation	COCO test-challenge	AR	75.1	G-RMI*
Pose Estimation	COCO test-challenge	AR50	90.7	G-RMI*
Pose Estimation	COCO test-challenge	AR75	80.7	G-RMI*
Pose Estimation	COCO test-challenge	ARL	74.5	G-RMI*
Pose Estimation	COCO test-challenge	ARM	69.7	G-RMI*
Pose Estimation	COCO test-dev	AP	64.9	G-RMI
Pose Estimation	COCO test-dev	AP50	85.5	G-RMI
Pose Estimation	COCO test-dev	AP75	71.3	G-RMI
Pose Estimation	COCO test-dev	APL	70	G-RMI
Pose Estimation	COCO test-dev	APM	62.3	G-RMI
Pose Estimation	COCO (Common Objects in Context)	AP	0.685	G-RMI*
Pose Estimation	COCO (Common Objects in Context)	AP	0.649	G-RMI
3D	COCO test-dev	AP	64.9	G-RMI
3D	COCO test-dev	AP50	85.5	G-RMI
3D	COCO test-dev	AP75	71.3	G-RMI
3D	COCO test-dev	APL	70	G-RMI
3D	COCO test-dev	AR	69.7	G-RMI
3D	COCO test-dev	AP50	85.5	G-RMI
3D	COCO test-dev	AP75	71.3	G-RMI
3D	COCO test-dev	APL	70	G-RMI
3D	COCO test-dev	APM	62.3	G-RMI
3D	COCO test-dev	AR	69.7	G-RMI
3D	COCO test-dev	AR50	88.7	G-RMI
3D	COCO test-dev	AR75	75.5	G-RMI
3D	COCO test-dev	ARL	77.1	G-RMI
3D	COCO test-dev	ARM	64.4	G-RMI
3D	COCO test-challenge	AP	69.1	G-RMI*
3D	COCO test-challenge	AP50	85.9	G-RMI*
3D	COCO test-challenge	AP75	75.2	G-RMI*
3D	COCO test-challenge	APL	82.4	G-RMI*
3D	COCO test-challenge	AR	75.1	G-RMI*
3D	COCO test-challenge	AR50	90.7	G-RMI*
3D	COCO test-challenge	AR75	80.7	G-RMI*
3D	COCO test-challenge	ARL	74.5	G-RMI*
3D	COCO test-challenge	ARM	69.7	G-RMI*
3D	COCO test-dev	AP	64.9	G-RMI
3D	COCO test-dev	AP50	85.5	G-RMI
3D	COCO test-dev	AP75	71.3	G-RMI
3D	COCO test-dev	APL	70	G-RMI
3D	COCO test-dev	APM	62.3	G-RMI
3D	COCO (Common Objects in Context)	AP	0.685	G-RMI*
3D	COCO (Common Objects in Context)	AP	0.649	G-RMI
Multi-Person Pose Estimation	COCO test-dev	AP	64.9	G-RMI
Multi-Person Pose Estimation	COCO test-dev	AP50	85.5	G-RMI
Multi-Person Pose Estimation	COCO test-dev	AP75	71.3	G-RMI
Multi-Person Pose Estimation	COCO test-dev	APL	70	G-RMI
Multi-Person Pose Estimation	COCO test-dev	APM	62.3	G-RMI
Multi-Person Pose Estimation	COCO (Common Objects in Context)	AP	0.685	G-RMI*
Multi-Person Pose Estimation	COCO (Common Objects in Context)	AP	0.649	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP	64.9	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP50	85.5	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP75	71.3	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	APL	70	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AR	69.7	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP50	85.5	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP75	71.3	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	APL	70	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	APM	62.3	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AR	69.7	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AR50	88.7	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AR75	75.5	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	ARL	77.1	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	ARM	64.4	G-RMI
1 Image, 2*2 Stitchi	COCO test-challenge	AP	69.1	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	AP50	85.9	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	AP75	75.2	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	APL	82.4	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	AR	75.1	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	AR50	90.7	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	AR75	80.7	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	ARL	74.5	G-RMI*
1 Image, 2*2 Stitchi	COCO test-challenge	ARM	69.7	G-RMI*
1 Image, 2*2 Stitchi	COCO test-dev	AP	64.9	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP50	85.5	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	AP75	71.3	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	APL	70	G-RMI
1 Image, 2*2 Stitchi	COCO test-dev	APM	62.3	G-RMI
1 Image, 2*2 Stitchi	COCO (Common Objects in Context)	AP	0.685	G-RMI*
1 Image, 2*2 Stitchi	COCO (Common Objects in Context)	AP	0.649	G-RMI

Towards Accurate Multi-person Pose Estimation in the Wild

Abstract

Results

Related Papers

Towards Accurate Multi-person Pose Estimation in the Wild

Abstract

Results

Related Papers