TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/On the representation and methodology for wide and short r...

On the representation and methodology for wide and short range head pose estimation

Alejandro Cobo, Roberto Valle, José M. Buenaposada, Luis Baumela

2024-01-11Pattern Recognition 2024 1Pose EstimationHead Pose Estimation
PaperPDFCode(official)

Abstract

Head pose estimation (HPE) is a problem of interest in computer vision to improve the performance of face processing tasks in semi-frontal or profile settings. Recent applications require the analysis of faces in the full 360{\deg} rotation range. Traditional approaches to solve the semi-frontal and profile cases are not directly amenable for the full rotation case. In this paper we analyze the methodology for short- and wide-range HPE and discuss which representations and metrics are adequate for each case. We show that the popular Euler angles representation is a good choice for short-range HPE, but not at extreme rotations. However, the Euler angles' gimbal lock problem prevents them from being used as a valid metric in any setting. We also revisit the current cross-data set evaluation methodology and note that the lack of alignment between the reference systems of the training and test data sets negatively biases the results of all articles in the literature. We introduce a procedure to quantify this misalignment and a new methodology for cross-data set HPE that establishes new, more accurate, SOTA for the 300W-LP|Biwi benchmark. We also propose a generalization of the geodesic angular distance metric that enables the construction of a loss that controls the contribution of each training sample to the optimization of the model. Finally, we introduce a wide range HPE benchmark based on the CMU Panoptic data set.

Results

TaskDatasetMetricValueModel
Pose EstimationPanopticGeodesic Error (GE)7.45WRHP-6D-Opal
Pose EstimationPanopticGeodesic Error (GE)7.7WRHP-6D
Pose EstimationPanopticGeodesic Error (GE)9.32WRHP-Quaternion
Pose EstimationPanopticGeodesic Error (GE)10.47WRHP-Euler
Pose EstimationAFLW2000Geodesic Error (GE)5.29SRHP-Euler
Pose EstimationAFLW2000MAE3.25SRHP-Euler
Pose EstimationAFLW2000Geodesic Error (GE)5.37SRHP-6D
Pose EstimationAFLW2000MAE3.49SRHP-6D
Pose EstimationBIWIGeodesic Error (GE)7.3SRHP-6D
Pose EstimationBIWIGeodesic Error - aligned (GE)5.48SRHP-6D
Pose EstimationBIWIMAE (trained with other data)3.98SRHP-6D
Pose EstimationBIWIMAE-aligned (trained with other data)3.21SRHP-6D
Pose EstimationBIWIGeodesic Error (GE)7.49SRHP-Euler
Pose EstimationBIWIGeodesic Error - aligned (GE)5.42SRHP-Euler
Pose EstimationBIWIMAE (trained with other data)4.13SRHP-Euler
Pose EstimationBIWIMAE-aligned (trained with other data)3.16SRHP-Euler
3DPanopticGeodesic Error (GE)7.45WRHP-6D-Opal
3DPanopticGeodesic Error (GE)7.7WRHP-6D
3DPanopticGeodesic Error (GE)9.32WRHP-Quaternion
3DPanopticGeodesic Error (GE)10.47WRHP-Euler
3DAFLW2000Geodesic Error (GE)5.29SRHP-Euler
3DAFLW2000MAE3.25SRHP-Euler
3DAFLW2000Geodesic Error (GE)5.37SRHP-6D
3DAFLW2000MAE3.49SRHP-6D
3DBIWIGeodesic Error (GE)7.3SRHP-6D
3DBIWIGeodesic Error - aligned (GE)5.48SRHP-6D
3DBIWIMAE (trained with other data)3.98SRHP-6D
3DBIWIMAE-aligned (trained with other data)3.21SRHP-6D
3DBIWIGeodesic Error (GE)7.49SRHP-Euler
3DBIWIGeodesic Error - aligned (GE)5.42SRHP-Euler
3DBIWIMAE (trained with other data)4.13SRHP-Euler
3DBIWIMAE-aligned (trained with other data)3.16SRHP-Euler
1 Image, 2*2 StitchiPanopticGeodesic Error (GE)7.45WRHP-6D-Opal
1 Image, 2*2 StitchiPanopticGeodesic Error (GE)7.7WRHP-6D
1 Image, 2*2 StitchiPanopticGeodesic Error (GE)9.32WRHP-Quaternion
1 Image, 2*2 StitchiPanopticGeodesic Error (GE)10.47WRHP-Euler
1 Image, 2*2 StitchiAFLW2000Geodesic Error (GE)5.29SRHP-Euler
1 Image, 2*2 StitchiAFLW2000MAE3.25SRHP-Euler
1 Image, 2*2 StitchiAFLW2000Geodesic Error (GE)5.37SRHP-6D
1 Image, 2*2 StitchiAFLW2000MAE3.49SRHP-6D
1 Image, 2*2 StitchiBIWIGeodesic Error (GE)7.3SRHP-6D
1 Image, 2*2 StitchiBIWIGeodesic Error - aligned (GE)5.48SRHP-6D
1 Image, 2*2 StitchiBIWIMAE (trained with other data)3.98SRHP-6D
1 Image, 2*2 StitchiBIWIMAE-aligned (trained with other data)3.21SRHP-6D
1 Image, 2*2 StitchiBIWIGeodesic Error (GE)7.49SRHP-Euler
1 Image, 2*2 StitchiBIWIGeodesic Error - aligned (GE)5.42SRHP-Euler
1 Image, 2*2 StitchiBIWIMAE (trained with other data)4.13SRHP-Euler
1 Image, 2*2 StitchiBIWIMAE-aligned (trained with other data)3.16SRHP-Euler

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16