The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

Guillem Brasó, Nikita Kister, Laura Leal-Taixé

2021-10-11ICCV 2021 10Pose Estimation Multi-Person Pose Estimation Clustering

Abstract

We introduce CenterGroup, an attention-based framework to estimate human poses from a set of identity-agnostic keypoints and person center predictions in an image. Our approach uses a transformer to obtain context-aware embeddings for all detected keypoints and centers and then applies multi-head attention to directly group joints into their corresponding person centers. While most bottom-up methods rely on non-learnable clustering at inference, CenterGroup uses a fully differentiable attention mechanism that we train end-to-end together with our keypoint detector. As a result, our method obtains state-of-the-art performance with up to 2.5x faster inference time than competing bottom-up methods. Our code is available at https://github.com/dvl-tum/center-group .

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	COCO (Common Objects in Context)	AP	0.714	CenterGroup
Pose Estimation	COCO (Common Objects in Context)	Test AP	71.4	CenterGroup
Pose Estimation	CrowdPose	AP Easy	76.6	CenterGroup
Pose Estimation	CrowdPose	AP Hard	61.5	CenterGroup
Pose Estimation	CrowdPose	AP Medium	70	CenterGroup
Pose Estimation	CrowdPose	mAP @0.5:0.95	69.4	CenterGroup
3D	COCO (Common Objects in Context)	AP	0.714	CenterGroup
3D	COCO (Common Objects in Context)	Test AP	71.4	CenterGroup
3D	CrowdPose	AP Easy	76.6	CenterGroup
3D	CrowdPose	AP Hard	61.5	CenterGroup
3D	CrowdPose	AP Medium	70	CenterGroup
3D	CrowdPose	mAP @0.5:0.95	69.4	CenterGroup
Multi-Person Pose Estimation	COCO (Common Objects in Context)	AP	0.714	CenterGroup
Multi-Person Pose Estimation	COCO (Common Objects in Context)	Test AP	71.4	CenterGroup
Multi-Person Pose Estimation	CrowdPose	AP Easy	76.6	CenterGroup
Multi-Person Pose Estimation	CrowdPose	AP Hard	61.5	CenterGroup
Multi-Person Pose Estimation	CrowdPose	AP Medium	70	CenterGroup
Multi-Person Pose Estimation	CrowdPose	mAP @0.5:0.95	69.4	CenterGroup
1 Image, 2*2 Stitchi	COCO (Common Objects in Context)	AP	0.714	CenterGroup
1 Image, 2*2 Stitchi	COCO (Common Objects in Context)	Test AP	71.4	CenterGroup
1 Image, 2*2 Stitchi	CrowdPose	AP Easy	76.6	CenterGroup
1 Image, 2*2 Stitchi	CrowdPose	AP Hard	61.5	CenterGroup
1 Image, 2*2 Stitchi	CrowdPose	AP Medium	70	CenterGroup
1 Image, 2*2 Stitchi	CrowdPose	mAP @0.5:0.95	69.4	CenterGroup

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

Abstract

Results

Related Papers

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

Abstract

Results

Related Papers