Fine-Grained Head Pose Estimation Without Keypoints

Nataniel Ruiz, Eunji Chong, James M. Rehg

2017-10-02Face Alignment Pose Estimation Gaze Estimation Head Pose Estimation

Paper PDF Code Code Code Code Code Code Code Code Code Code Code(official)Code Code Code

Abstract

Estimating the head pose of a person is a crucial problem that has a large amount of applications such as aiding in gaze estimation, modeling attention, fitting 3D models to video and performing face alignment. Traditionally head pose is computed by estimating some keypoints from the target face and solving the 2D to 3D correspondence problem with a mean human head model. We argue that this is a fragile method because it relies entirely on landmark detection performance, the extraneous head model and an ad-hoc fitting step. We present an elegant and robust way to determine pose by training a multi-loss convolutional neural network on 300W-LP, a large synthetically expanded dataset, to predict intrinsic Euler angles (yaw, pitch and roll) directly from image intensities through joint binned pose classification and regression. We present empirical tests on common in-the-wild pose benchmark datasets which show state-of-the-art results. Additionally we test our method on a dataset usually used for pose estimation using depth and start to close the gap with state-of-the-art depth pose methods. We open-source our training and testing code as well as release our pre-trained models.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	AFLW2000	Geodesic Error (GE)	993	Hopenet
Pose Estimation	AFLW2000	MAE	6.15	Hopenet
Pose Estimation	AFLW2000	MAE	6.155	Multi-Loss ResNet50 (a=2)
Pose Estimation	BIWI	Geodesic Error (GE)	9.53	hopenet
Pose Estimation	BIWI	Geodesic Error - aligned (GE)	6.6	hopenet
Pose Estimation	BIWI	MAE (trained with other data)	4.89	hopenet
Pose Estimation	BIWI	MAE-aligned (trained with other data)	3.48	hopenet
Pose Estimation	BIWI	MAE (trained with BIWI data)	4.895	Multi-Loss ResNet50
Pose Estimation	AFLW	MAE	5.324	Ruiz et al.
3D	AFLW2000	Geodesic Error (GE)	993	Hopenet
3D	AFLW2000	MAE	6.15	Hopenet
3D	AFLW2000	MAE	6.155	Multi-Loss ResNet50 (a=2)
3D	BIWI	Geodesic Error (GE)	9.53	hopenet
3D	BIWI	Geodesic Error - aligned (GE)	6.6	hopenet
3D	BIWI	MAE (trained with other data)	4.89	hopenet
3D	BIWI	MAE-aligned (trained with other data)	3.48	hopenet
3D	BIWI	MAE (trained with BIWI data)	4.895	Multi-Loss ResNet50
3D	AFLW	MAE	5.324	Ruiz et al.
1 Image, 2*2 Stitchi	AFLW2000	Geodesic Error (GE)	993	Hopenet
1 Image, 2*2 Stitchi	AFLW2000	MAE	6.15	Hopenet
1 Image, 2*2 Stitchi	AFLW2000	MAE	6.155	Multi-Loss ResNet50 (a=2)
1 Image, 2*2 Stitchi	BIWI	Geodesic Error (GE)	9.53	hopenet
1 Image, 2*2 Stitchi	BIWI	Geodesic Error - aligned (GE)	6.6	hopenet
1 Image, 2*2 Stitchi	BIWI	MAE (trained with other data)	4.89	hopenet
1 Image, 2*2 Stitchi	BIWI	MAE-aligned (trained with other data)	3.48	hopenet
1 Image, 2*2 Stitchi	BIWI	MAE (trained with BIWI data)	4.895	Multi-Loss ResNet50
1 Image, 2*2 Stitchi	AFLW	MAE	5.324	Ruiz et al.

Fine-Grained Head Pose Estimation Without Keypoints

Abstract

Results

Related Papers

Fine-Grained Head Pose Estimation Without Keypoints

Abstract

Results

Related Papers