TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hybrid coarse-fine classification for head pose estimation

Hybrid coarse-fine classification for head pose estimation

Haofan Wang, Zhenghua Chen, Yi Zhou

2019-01-21Face AlignmentregressionQuantizationPose Estimation3D ReconstructionGaze EstimationGeneral ClassificationClassificationHead Pose Estimation
PaperPDFCode(official)

Abstract

Head pose estimation, which computes the intrinsic Euler angles (yaw, pitch, roll) from the human, is crucial for gaze estimation, face alignment, and 3D reconstruction. Traditional approaches heavily relies on the accuracy of facial landmarks. It limits their performances, especially when the visibility of the face is not in good condition. In this paper, to do the estimation without facial landmarks, we combine the coarse and fine regression output together for a deep network. Utilizing more quantization units for the angles, a fine classifier is trained with the help of other auxiliary coarse units. Integrating regression is adopted to get the final prediction. The proposed approach is evaluated on three challenging benchmarks. It achieves the state-of-the-art on AFLW2000, BIWI and performs favorably on AFLW. The code has been released on Github.

Results

TaskDatasetMetricValueModel
Pose EstimationAFLW2000MAE5.395Hybrid Coarse-Fine
Pose EstimationBIWIMAE (trained with BIWI data)3.0174Hybrid Coarse-Fine
Pose EstimationAFLWMAE5.09Hybrid Coarse-Fine
3DAFLW2000MAE5.395Hybrid Coarse-Fine
3DBIWIMAE (trained with BIWI data)3.0174Hybrid Coarse-Fine
3DAFLWMAE5.09Hybrid Coarse-Fine
1 Image, 2*2 StitchiAFLW2000MAE5.395Hybrid Coarse-Fine
1 Image, 2*2 StitchiBIWIMAE (trained with BIWI data)3.0174Hybrid Coarse-Fine
1 Image, 2*2 StitchiAFLWMAE5.09Hybrid Coarse-Fine

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC2025-07-18Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17Angle Estimation of a Single Source with Massive Uniform Circular Arrays2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17