TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/6D Rotation Representation For Unconstrained Head Pose Est...

6D Rotation Representation For Unconstrained Head Pose Estimation

Thorsten Hempel, Ahmed A. Abdelrahman, Ayoub Al-Hamadi

2022-02-25Pose EstimationPose PredictionHead Pose Estimation
PaperPDFCode(official)Code

Abstract

In this paper, we present a method for unconstrained end-to-end head pose estimation. We address the problem of ambiguous rotation labels by introducing the rotation matrix formalism for our ground truth data and propose a continuous 6D rotation matrix representation for efficient and robust direct regression. This way, our method can learn the full rotation appearance which is contrary to previous approaches that restrict the pose prediction to a narrow-angle for satisfactory results. In addition, we propose a geodesic distance-based loss to penalize our network with respect to the SO(3) manifold geometry. Experiments on the public AFLW2000 and BIWI datasets demonstrate that our proposed method significantly outperforms other state-of-the-art methods by up to 20\%. We open-source our training and testing code along with our pre-trained models: https://github.com/thohemp/6DRepNet.

Results

TaskDatasetMetricValueModel
Pose EstimationPanopticGeodesic Error (GE)8.086DRepNet
Pose EstimationAFLW2000MAE3.976DRepNet
Pose EstimationBIWIMAE (trained with BIWI data)2.666DRepNet
Pose EstimationBIWIMAE (trained with other data)3.476DRepNet
3DPanopticGeodesic Error (GE)8.086DRepNet
3DAFLW2000MAE3.976DRepNet
3DBIWIMAE (trained with BIWI data)2.666DRepNet
3DBIWIMAE (trained with other data)3.476DRepNet
1 Image, 2*2 StitchiPanopticGeodesic Error (GE)8.086DRepNet
1 Image, 2*2 StitchiAFLW2000MAE3.976DRepNet
1 Image, 2*2 StitchiBIWIMAE (trained with BIWI data)2.666DRepNet
1 Image, 2*2 StitchiBIWIMAE (trained with other data)3.476DRepNet

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16