TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ASMNet: a Lightweight Deep Neural Network for Face Alignme...

ASMNet: a Lightweight Deep Neural Network for Face Alignment and Pose Estimation

Ali Pourramezan Fard, Hojjat Abdollahi, Mohammad Mahoor

2021-02-27Face AlignmentTransfer LearningPose EstimationHead Pose Estimation
PaperPDFCode(official)

Abstract

Active Shape Model (ASM) is a statistical model of object shapes that represents a target structure. ASM can guide machine learning algorithms to fit a set of points representing an object (e.g., face) onto an image. This paper presents a lightweight Convolutional Neural Network (CNN) architecture with a loss function being assisted by ASM for face alignment and estimating head pose in the wild. We use ASM to first guide the network towards learning a smoother distribution of the facial landmark points. Inspired by transfer learning, during the training process, we gradually harden the regression problem and guide the network towards learning the original landmark points distribution. We define multi-tasks in our loss function that are responsible for detecting facial landmark points as well as estimating the face pose. Learning multiple correlated tasks simultaneously builds synergy and improves the performance of individual tasks. We compare the performance of our proposed model called ASMNet with MobileNetV2 (which is about 2 times bigger than ASMNet) in both the face alignment and pose estimation tasks. Experimental results on challenging datasets show that by using the proposed ASM assisted loss function, the ASMNet performance is comparable with MobileNetV2 in the face alignment task. In addition, for face pose estimation, ASMNet performs much better than MobileNetV2. ASMNet achieves an acceptable performance for facial landmark points detection and pose estimation while having a significantly smaller number of parameters and floating-point operations compared to many CNN-based models.

Results

TaskDatasetMetricValueModel
Facial Recognition and Modelling300WNME_inter-ocular (%, Challenge)7.35MobileNetV2
Facial Recognition and Modelling300WNME_inter-ocular (%, Common)3.88MobileNetV2
Facial Recognition and Modelling300WNME_inter-ocular (%, Full)4.59MobileNetV2
Facial Recognition and Modelling300WNME_inter-ocular (%, Challenge)8.2ASMNet
Facial Recognition and Modelling300WNME_inter-ocular (%, Common)4.82ASMNet
Facial Recognition and Modelling300WNME_inter-ocular (%, Full)5.5ASMNet
Facial Recognition and ModellingWFLWNME (inter-ocular)9.41MobileNetV2
Facial Recognition and ModellingWFLWNME (inter-ocular)10.77ASMNet
Pose Estimation300W (Full)MAE pitch (º)1.8ASMNet
Pose Estimation300W (Full)MAE roll (º)1.24ASMNet
Pose Estimation300W (Full)MAE yaw (º)1.62ASMNet
Pose EstimationWFLWMAE mean (º)2.7ASMNet
Pose EstimationWFLWMAE pitch (º)2.93ASMNet
Pose EstimationWFLWMAE roll (º)2.21ASMNet
Pose EstimationWFLWMAE yaw (º)2.97ASMNet
Pose EstimationCOFWMAE pitch (º)2.72ASMNet
Pose EstimationCOFWMAE yaw (º)2.91ASMNet
Face Reconstruction300WNME_inter-ocular (%, Challenge)7.35MobileNetV2
Face Reconstruction300WNME_inter-ocular (%, Common)3.88MobileNetV2
Face Reconstruction300WNME_inter-ocular (%, Full)4.59MobileNetV2
Face Reconstruction300WNME_inter-ocular (%, Challenge)8.2ASMNet
Face Reconstruction300WNME_inter-ocular (%, Common)4.82ASMNet
Face Reconstruction300WNME_inter-ocular (%, Full)5.5ASMNet
Face ReconstructionWFLWNME (inter-ocular)9.41MobileNetV2
Face ReconstructionWFLWNME (inter-ocular)10.77ASMNet
3D300W (Full)MAE pitch (º)1.8ASMNet
3D300W (Full)MAE roll (º)1.24ASMNet
3D300W (Full)MAE yaw (º)1.62ASMNet
3DWFLWMAE mean (º)2.7ASMNet
3DWFLWMAE pitch (º)2.93ASMNet
3DWFLWMAE roll (º)2.21ASMNet
3DWFLWMAE yaw (º)2.97ASMNet
3DCOFWMAE pitch (º)2.72ASMNet
3DCOFWMAE yaw (º)2.91ASMNet
3D300WNME_inter-ocular (%, Challenge)7.35MobileNetV2
3D300WNME_inter-ocular (%, Common)3.88MobileNetV2
3D300WNME_inter-ocular (%, Full)4.59MobileNetV2
3D300WNME_inter-ocular (%, Challenge)8.2ASMNet
3D300WNME_inter-ocular (%, Common)4.82ASMNet
3D300WNME_inter-ocular (%, Full)5.5ASMNet
3DWFLWNME (inter-ocular)9.41MobileNetV2
3DWFLWNME (inter-ocular)10.77ASMNet
3D Face Modelling300WNME_inter-ocular (%, Challenge)7.35MobileNetV2
3D Face Modelling300WNME_inter-ocular (%, Common)3.88MobileNetV2
3D Face Modelling300WNME_inter-ocular (%, Full)4.59MobileNetV2
3D Face Modelling300WNME_inter-ocular (%, Challenge)8.2ASMNet
3D Face Modelling300WNME_inter-ocular (%, Common)4.82ASMNet
3D Face Modelling300WNME_inter-ocular (%, Full)5.5ASMNet
3D Face ModellingWFLWNME (inter-ocular)9.41MobileNetV2
3D Face ModellingWFLWNME (inter-ocular)10.77ASMNet
3D Face Reconstruction300WNME_inter-ocular (%, Challenge)7.35MobileNetV2
3D Face Reconstruction300WNME_inter-ocular (%, Common)3.88MobileNetV2
3D Face Reconstruction300WNME_inter-ocular (%, Full)4.59MobileNetV2
3D Face Reconstruction300WNME_inter-ocular (%, Challenge)8.2ASMNet
3D Face Reconstruction300WNME_inter-ocular (%, Common)4.82ASMNet
3D Face Reconstruction300WNME_inter-ocular (%, Full)5.5ASMNet
3D Face ReconstructionWFLWNME (inter-ocular)9.41MobileNetV2
3D Face ReconstructionWFLWNME (inter-ocular)10.77ASMNet
1 Image, 2*2 Stitchi300W (Full)MAE pitch (º)1.8ASMNet
1 Image, 2*2 Stitchi300W (Full)MAE roll (º)1.24ASMNet
1 Image, 2*2 Stitchi300W (Full)MAE yaw (º)1.62ASMNet
1 Image, 2*2 StitchiWFLWMAE mean (º)2.7ASMNet
1 Image, 2*2 StitchiWFLWMAE pitch (º)2.93ASMNet
1 Image, 2*2 StitchiWFLWMAE roll (º)2.21ASMNet
1 Image, 2*2 StitchiWFLWMAE yaw (º)2.97ASMNet
1 Image, 2*2 StitchiCOFWMAE pitch (º)2.72ASMNet
1 Image, 2*2 StitchiCOFWMAE yaw (º)2.91ASMNet

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16