TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Robust and Unconstrained Full Range of Rotation He...

Towards Robust and Unconstrained Full Range of Rotation Head Pose Estimation

Thorsten Hempel, Ahmed A. Abdelrahman, Ayoub Al-Hamadi

2023-09-14PredictionPose EstimationPose PredictionHead Pose Estimation
PaperPDFCode(official)Code

Abstract

Estimating the head pose of a person is a crucial problem for numerous applications that is yet mainly addressed as a subtask of frontal pose prediction. We present a novel method for unconstrained end-to-end head pose estimation to tackle the challenging task of full range of orientation head pose prediction. We address the issue of ambiguous rotation labels by introducing the rotation matrix formalism for our ground truth data and propose a continuous 6D rotation matrix representation for efficient and robust direct regression. This allows to efficiently learn full rotation appearance and to overcome the limitations of the current state-of-the-art. Together with new accumulated training data that provides full head pose rotation data and a geodesic loss approach for stable learning, we design an advanced model that is able to predict an extended range of head orientations. An extensive evaluation on public datasets demonstrates that our method significantly outperforms other state-of-the-art methods in an efficient and robust manner, while its advanced prediction range allows the expansion of the application area. We open-source our training and testing code along with our trained models: https://github.com/thohemp/6DRepNet360.

Results

TaskDatasetMetricValueModel
Pose EstimationCMU Panoptic + 300W-LPMAE2.666DRepNet360
Pose EstimationAFLW2000MAE3.616DRepNet
Pose EstimationAFLW2000MAEV4.666DRepNet
Pose EstimationAFLW2000MAEV4.646DRepNet360
Pose EstimationBIWIMAE (trained with other data)3.396DRepNet360
Pose EstimationBIWIMAEV4.856DRepNet360
Pose EstimationBIWIMAEV5.326DRepNet
3DCMU Panoptic + 300W-LPMAE2.666DRepNet360
3DAFLW2000MAE3.616DRepNet
3DAFLW2000MAEV4.666DRepNet
3DAFLW2000MAEV4.646DRepNet360
3DBIWIMAE (trained with other data)3.396DRepNet360
3DBIWIMAEV4.856DRepNet360
3DBIWIMAEV5.326DRepNet
1 Image, 2*2 StitchiCMU Panoptic + 300W-LPMAE2.666DRepNet360
1 Image, 2*2 StitchiAFLW2000MAE3.616DRepNet
1 Image, 2*2 StitchiAFLW2000MAEV4.666DRepNet
1 Image, 2*2 StitchiAFLW2000MAEV4.646DRepNet360
1 Image, 2*2 StitchiBIWIMAE (trained with other data)3.396DRepNet360
1 Image, 2*2 StitchiBIWIMAEV4.856DRepNet360
1 Image, 2*2 StitchiBIWIMAEV5.326DRepNet

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16