TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/6DoF Head Pose Estimation through Explicit Bidirectional I...

6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry

Sungho Chun, Ju Yong Chang

2024-07-19TranslationPose EstimationHead Pose Estimation
PaperPDFCode(official)

Abstract

This study addresses the nuanced challenge of estimating head translations within the context of six-degrees-of-freedom (6DoF) head pose estimation, placing emphasis on this aspect over the more commonly studied head rotations. Identifying a gap in existing methodologies, we recognized the underutilized potential synergy between facial geometry and head translation. To bridge this gap, we propose a novel approach called the head Translation, Rotation, and face Geometry network (TRG), which stands out for its explicit bidirectional interaction structure. This structure has been carefully designed to leverage the complementary relationship between face geometry and head translation, marking a significant advancement in the field of head pose estimation. Our contributions also include the development of a strategy for estimating bounding box correction parameters and a technique for aligning landmarks to image. Both of these innovations demonstrate superior performance in 6DoF head pose estimation tasks. Extensive experiments conducted on ARKitFace and BIWI datasets confirm that the proposed method outperforms current state-of-the-art techniques. Codes are released at https://github.com/asw91666/TRG-Release.

Results

TaskDatasetMetricValueModel
Pose EstimationARKitFaceMAE(val)0.91TRG (w/ 300WLP)
Pose EstimationARKitFaceMAE_t3.62TRG (w/ 300WLP)
Pose EstimationARKitFaceMAE(val)0.92TRG
Pose EstimationARKitFaceMAE_t3.64TRG
Pose EstimationBIWIGeodesic Error - aligned (GE)5.68TRG (w/ 300WLP)
Pose EstimationBIWIMAE (trained with other data)2.75TRG (w/ 300WLP)
Pose EstimationBIWIMAE_t12.97TRG (w/ 300WLP)
Pose EstimationBIWIGeodesic Error - aligned (GE)5.35TRG
Pose EstimationBIWIMAE (trained with other data)2.89TRG
Pose EstimationBIWIMAE_t14.31TRG
3DARKitFaceMAE(val)0.91TRG (w/ 300WLP)
3DARKitFaceMAE_t3.62TRG (w/ 300WLP)
3DARKitFaceMAE(val)0.92TRG
3DARKitFaceMAE_t3.64TRG
3DBIWIGeodesic Error - aligned (GE)5.68TRG (w/ 300WLP)
3DBIWIMAE (trained with other data)2.75TRG (w/ 300WLP)
3DBIWIMAE_t12.97TRG (w/ 300WLP)
3DBIWIGeodesic Error - aligned (GE)5.35TRG
3DBIWIMAE (trained with other data)2.89TRG
3DBIWIMAE_t14.31TRG
1 Image, 2*2 StitchiARKitFaceMAE(val)0.91TRG (w/ 300WLP)
1 Image, 2*2 StitchiARKitFaceMAE_t3.62TRG (w/ 300WLP)
1 Image, 2*2 StitchiARKitFaceMAE(val)0.92TRG
1 Image, 2*2 StitchiARKitFaceMAE_t3.64TRG
1 Image, 2*2 StitchiBIWIGeodesic Error - aligned (GE)5.68TRG (w/ 300WLP)
1 Image, 2*2 StitchiBIWIMAE (trained with other data)2.75TRG (w/ 300WLP)
1 Image, 2*2 StitchiBIWIMAE_t12.97TRG (w/ 300WLP)
1 Image, 2*2 StitchiBIWIGeodesic Error - aligned (GE)5.35TRG
1 Image, 2*2 StitchiBIWIMAE (trained with other data)2.89TRG
1 Image, 2*2 StitchiBIWIMAE_t14.31TRG

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16