Roberto Valle, José M. Buenaposada, Antonio Valdés, Luis Baumela
Face alignment algorithms locate a set of landmark points in images of faces taken in unrestricted situations. State-of-the-art approaches typically fail or lose accuracy in the presence of occlusions, strong deformations, large pose variations and ambiguous configurations. In this paper we present 3DDE, a robust and efficient face alignment algorithm based on a coarse-to-fine cascade of ensembles of regression trees. It is initialized by robustly fitting a 3D face model to the probability maps produced by a convolutional neural network. With this initialization we address self-occlusions and large face rotations. Further, the regressor implicitly imposes a prior face shape on the solution, addressing occlusions and ambiguous face configurations. Its coarse-to-fine structure tackles the combinatorial explosion of parts deformation. In the experiments performed, 3DDE improves the state-of-the-art in 300W, COFW, AFLW and WFLW data sets. Finally, we perform cross-dataset experiments that reveal the existence of a significant data set bias in these benchmarks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | COFW | Recall at 80% precision (Landmarks Visibility) | 63.89 | 3DDE (Inter-pupil Norm) |
| Facial Recognition and Modelling | 300W | NME_inter-ocular (%, Challenge) | 4.92 | 3DDE |
| Facial Recognition and Modelling | 300W | NME_inter-ocular (%, Common) | 2.69 | 3DDE |
| Facial Recognition and Modelling | 300W | NME_inter-ocular (%, Full) | 3.13 | 3DDE |
| Facial Recognition and Modelling | 300W | NME_inter-pupil (%, Challenge) | 7.1 | 3DDE |
| Facial Recognition and Modelling | 300W | NME_inter-pupil (%, Common) | 3.73 | 3DDE |
| Facial Recognition and Modelling | 300W | NME_inter-pupil (%, Full) | 4.39 | 3DDE |
| Facial Recognition and Modelling | WFLW | AUC@10 (inter-ocular) | 55.44 | 3DDE |
| Facial Recognition and Modelling | WFLW | FR@10 (inter-ocular) | 5.04 | 3DDE |
| Facial Recognition and Modelling | WFLW | NME (inter-ocular) | 4.68 | 3DDE |
| Facial Recognition and Modelling | 300W Split 2 | AUC@8 (inter-ocular) | 53.94 | 3DDE |
| Facial Recognition and Modelling | 300W Split 2 | FR@8 (inter-ocular) | 2.33 | 3DDE |
| Facial Recognition and Modelling | 300W Split 2 | NME (inter-ocular) | 3.73 | 3DDE |
| Facial Recognition and Modelling | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| Facial Recognition and Modelling | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |
| Facial Landmark Detection | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| Facial Landmark Detection | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |
| Face Reconstruction | COFW | Recall at 80% precision (Landmarks Visibility) | 63.89 | 3DDE (Inter-pupil Norm) |
| Face Reconstruction | 300W | NME_inter-ocular (%, Challenge) | 4.92 | 3DDE |
| Face Reconstruction | 300W | NME_inter-ocular (%, Common) | 2.69 | 3DDE |
| Face Reconstruction | 300W | NME_inter-ocular (%, Full) | 3.13 | 3DDE |
| Face Reconstruction | 300W | NME_inter-pupil (%, Challenge) | 7.1 | 3DDE |
| Face Reconstruction | 300W | NME_inter-pupil (%, Common) | 3.73 | 3DDE |
| Face Reconstruction | 300W | NME_inter-pupil (%, Full) | 4.39 | 3DDE |
| Face Reconstruction | 300W Split 2 | AUC@8 (inter-ocular) | 53.94 | 3DDE |
| Face Reconstruction | 300W Split 2 | FR@8 (inter-ocular) | 2.33 | 3DDE |
| Face Reconstruction | 300W Split 2 | NME (inter-ocular) | 3.73 | 3DDE |
| Face Reconstruction | WFLW | AUC@10 (inter-ocular) | 55.44 | 3DDE |
| Face Reconstruction | WFLW | FR@10 (inter-ocular) | 5.04 | 3DDE |
| Face Reconstruction | WFLW | NME (inter-ocular) | 4.68 | 3DDE |
| Face Reconstruction | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| Face Reconstruction | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |
| 3D | COFW | Recall at 80% precision (Landmarks Visibility) | 63.89 | 3DDE (Inter-pupil Norm) |
| 3D | 300W | NME_inter-ocular (%, Challenge) | 4.92 | 3DDE |
| 3D | 300W | NME_inter-ocular (%, Common) | 2.69 | 3DDE |
| 3D | 300W | NME_inter-ocular (%, Full) | 3.13 | 3DDE |
| 3D | 300W | NME_inter-pupil (%, Challenge) | 7.1 | 3DDE |
| 3D | 300W | NME_inter-pupil (%, Common) | 3.73 | 3DDE |
| 3D | 300W | NME_inter-pupil (%, Full) | 4.39 | 3DDE |
| 3D | 300W Split 2 | AUC@8 (inter-ocular) | 53.94 | 3DDE |
| 3D | 300W Split 2 | FR@8 (inter-ocular) | 2.33 | 3DDE |
| 3D | 300W Split 2 | NME (inter-ocular) | 3.73 | 3DDE |
| 3D | WFLW | AUC@10 (inter-ocular) | 55.44 | 3DDE |
| 3D | WFLW | FR@10 (inter-ocular) | 5.04 | 3DDE |
| 3D | WFLW | NME (inter-ocular) | 4.68 | 3DDE |
| 3D | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| 3D | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |
| 3D Face Modelling | COFW | Recall at 80% precision (Landmarks Visibility) | 63.89 | 3DDE (Inter-pupil Norm) |
| 3D Face Modelling | 300W | NME_inter-ocular (%, Challenge) | 4.92 | 3DDE |
| 3D Face Modelling | 300W | NME_inter-ocular (%, Common) | 2.69 | 3DDE |
| 3D Face Modelling | 300W | NME_inter-ocular (%, Full) | 3.13 | 3DDE |
| 3D Face Modelling | 300W | NME_inter-pupil (%, Challenge) | 7.1 | 3DDE |
| 3D Face Modelling | 300W | NME_inter-pupil (%, Common) | 3.73 | 3DDE |
| 3D Face Modelling | 300W | NME_inter-pupil (%, Full) | 4.39 | 3DDE |
| 3D Face Modelling | WFLW | AUC@10 (inter-ocular) | 55.44 | 3DDE |
| 3D Face Modelling | WFLW | FR@10 (inter-ocular) | 5.04 | 3DDE |
| 3D Face Modelling | WFLW | NME (inter-ocular) | 4.68 | 3DDE |
| 3D Face Modelling | 300W Split 2 | AUC@8 (inter-ocular) | 53.94 | 3DDE |
| 3D Face Modelling | 300W Split 2 | FR@8 (inter-ocular) | 2.33 | 3DDE |
| 3D Face Modelling | 300W Split 2 | NME (inter-ocular) | 3.73 | 3DDE |
| 3D Face Modelling | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| 3D Face Modelling | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |
| 3D Face Reconstruction | COFW | Recall at 80% precision (Landmarks Visibility) | 63.89 | 3DDE (Inter-pupil Norm) |
| 3D Face Reconstruction | 300W | NME_inter-ocular (%, Challenge) | 4.92 | 3DDE |
| 3D Face Reconstruction | 300W | NME_inter-ocular (%, Common) | 2.69 | 3DDE |
| 3D Face Reconstruction | 300W | NME_inter-ocular (%, Full) | 3.13 | 3DDE |
| 3D Face Reconstruction | 300W | NME_inter-pupil (%, Challenge) | 7.1 | 3DDE |
| 3D Face Reconstruction | 300W | NME_inter-pupil (%, Common) | 3.73 | 3DDE |
| 3D Face Reconstruction | 300W | NME_inter-pupil (%, Full) | 4.39 | 3DDE |
| 3D Face Reconstruction | WFLW | AUC@10 (inter-ocular) | 55.44 | 3DDE |
| 3D Face Reconstruction | WFLW | FR@10 (inter-ocular) | 5.04 | 3DDE |
| 3D Face Reconstruction | WFLW | NME (inter-ocular) | 4.68 | 3DDE |
| 3D Face Reconstruction | 300W Split 2 | AUC@8 (inter-ocular) | 53.94 | 3DDE |
| 3D Face Reconstruction | 300W Split 2 | FR@8 (inter-ocular) | 2.33 | 3DDE |
| 3D Face Reconstruction | 300W Split 2 | NME (inter-ocular) | 3.73 | 3DDE |
| 3D Face Reconstruction | 300W | NME | 3.13 | 3DDE (Inter-ocular Norm) |
| 3D Face Reconstruction | AFLW-Full | Mean NME | 2.01 | 3DDE (Box height Norm, 19 landmarks - no earlobs) |