HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang, Lei Zhang

2019-08-27CVPR 2020 6Representation Learning 2D Human Pose Estimation Pose Estimation Multi-Person Pose Estimation Pose Prediction

Paper PDF Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code(official)Code Code Code

Abstract

Bottom-up human pose estimation methods have difficulties in predicting the correct pose for small persons due to challenges in scale variation. In this paper, we present HigherHRNet: a novel bottom-up human pose estimation method for learning scale-aware representations using high-resolution feature pyramids. Equipped with multi-resolution supervision for training and multi-resolution aggregation for inference, the proposed approach is able to solve the scale variation challenge in bottom-up multi-person pose estimation and localize keypoints more precisely, especially for small person. The feature pyramid in HigherHRNet consists of feature map outputs from HRNet and upsampled higher-resolution outputs through a transposed convolution. HigherHRNet outperforms the previous best bottom-up method by 2.5% AP for medium person on COCO test-dev, showing its effectiveness in handling scale variation. Furthermore, HigherHRNet achieves new state-of-the-art result on COCO test-dev (70.5% AP) without using refinement or other post-processing techniques, surpassing all existing bottom-up methods. HigherHRNet even surpasses all top-down methods on CrowdPose test (67.6% AP), suggesting its robustness in crowded scene. The code and models are available at https://github.com/HRNet/Higher-HRNet-Human-Pose-Estimation.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	UAV-Human	mAP	56.5	HigherHRNet
Pose Estimation	COCO test-dev	AP	70.5	HigherHRNet (HR-Net-48)
Pose Estimation	COCO test-dev	AP50	89.3	HigherHRNet (HR-Net-48)
Pose Estimation	COCO test-dev	AP75	77.2	HigherHRNet (HR-Net-48)
Pose Estimation	COCO test-dev	APL	75.8	HigherHRNet (HR-Net-48)
Pose Estimation	COCO test-dev	APM	66.6	HigherHRNet (HR-Net-48)
Pose Estimation	CrowdPose	AP Easy	75.8	HigherHRNet(HR-Net-48)
Pose Estimation	CrowdPose	AP Hard	58.9	HigherHRNet(HR-Net-48)
Pose Estimation	CrowdPose	AP Medium	68.1	HigherHRNet(HR-Net-48)
Pose Estimation	CrowdPose	mAP @0.5:0.95	67.6	HigherHRNet(HR-Net-48)
3D	UAV-Human	mAP	56.5	HigherHRNet
3D	COCO test-dev	AP	70.5	HigherHRNet (HR-Net-48)
3D	COCO test-dev	AP50	89.3	HigherHRNet (HR-Net-48)
3D	COCO test-dev	AP75	77.2	HigherHRNet (HR-Net-48)
3D	COCO test-dev	APL	75.8	HigherHRNet (HR-Net-48)
3D	COCO test-dev	APM	66.6	HigherHRNet (HR-Net-48)
3D	CrowdPose	AP Easy	75.8	HigherHRNet(HR-Net-48)
3D	CrowdPose	AP Hard	58.9	HigherHRNet(HR-Net-48)
3D	CrowdPose	AP Medium	68.1	HigherHRNet(HR-Net-48)
3D	CrowdPose	mAP @0.5:0.95	67.6	HigherHRNet(HR-Net-48)
Multi-Person Pose Estimation	COCO test-dev	AP	70.5	HigherHRNet (HR-Net-48)
Multi-Person Pose Estimation	COCO test-dev	AP50	89.3	HigherHRNet (HR-Net-48)
Multi-Person Pose Estimation	COCO test-dev	AP75	77.2	HigherHRNet (HR-Net-48)
Multi-Person Pose Estimation	COCO test-dev	APL	75.8	HigherHRNet (HR-Net-48)
Multi-Person Pose Estimation	COCO test-dev	APM	66.6	HigherHRNet (HR-Net-48)
Multi-Person Pose Estimation	CrowdPose	AP Easy	75.8	HigherHRNet(HR-Net-48)
Multi-Person Pose Estimation	CrowdPose	AP Hard	58.9	HigherHRNet(HR-Net-48)
Multi-Person Pose Estimation	CrowdPose	AP Medium	68.1	HigherHRNet(HR-Net-48)
Multi-Person Pose Estimation	CrowdPose	mAP @0.5:0.95	67.6	HigherHRNet(HR-Net-48)
1 Image, 2*2 Stitchi	UAV-Human	mAP	56.5	HigherHRNet
1 Image, 2*2 Stitchi	COCO test-dev	AP	70.5	HigherHRNet (HR-Net-48)
1 Image, 2*2 Stitchi	COCO test-dev	AP50	89.3	HigherHRNet (HR-Net-48)
1 Image, 2*2 Stitchi	COCO test-dev	AP75	77.2	HigherHRNet (HR-Net-48)
1 Image, 2*2 Stitchi	COCO test-dev	APL	75.8	HigherHRNet (HR-Net-48)
1 Image, 2*2 Stitchi	COCO test-dev	APM	66.6	HigherHRNet (HR-Net-48)
1 Image, 2*2 Stitchi	CrowdPose	AP Easy	75.8	HigherHRNet(HR-Net-48)
1 Image, 2*2 Stitchi	CrowdPose	AP Hard	58.9	HigherHRNet(HR-Net-48)
1 Image, 2*2 Stitchi	CrowdPose	AP Medium	68.1	HigherHRNet(HR-Net-48)
1 Image, 2*2 Stitchi	CrowdPose	mAP @0.5:0.95	67.6	HigherHRNet(HR-Net-48)

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

Abstract

Results

Related Papers

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

Abstract

Results

Related Papers