Abhinav Kumar, Tim K. Marks, Wenxuan Mou, Ye Wang, Michael Jones, Anoop Cherian, Toshiaki Koike-Akino, Xiaoming Liu, Chen Feng
Modern face alignment methods have become quite accurate at predicting the locations of facial landmarks, but they do not typically estimate the uncertainty of their predicted locations nor predict whether landmarks are visible. In this paper, we present a novel framework for jointly predicting landmark locations, associated uncertainties of these predicted locations, and landmark visibilities. We model these as mixed random variables and estimate them using a deep network trained with our proposed Location, Uncertainty, and Visibility Likelihood (LUVLi) loss. In addition, we release an entirely new labeling of a large face alignment dataset with over 19,000 face images in a full range of head poses. Each face is manually labeled with the ground-truth locations of 68 landmarks, with the additional information of whether each landmark is unoccluded, self-occluded (due to extreme head poses), or externally occluded. Not only does our joint estimation yield accurate estimates of the uncertainty of predicted landmark locations, but it also yields state-of-the-art estimates for the landmark locations themselves on multiple standard face alignment datasets. Our method's estimates of the uncertainty of predicted landmark locations could be used to automatically identify input images on which face alignment fails, which can be critical for downstream tasks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | WFW (Extra Data) | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| Facial Recognition and Modelling | WFW (Extra Data) | FR@10 (inter-ocular) | 3.12 | LUVLi |
| Facial Recognition and Modelling | WFW (Extra Data) | NME (inter-ocular) | 4.37 | LUVLi |
| Facial Recognition and Modelling | COFW-68 (300WLP) | AUC@7 | 63.4 | LUVLI |
| Facial Recognition and Modelling | COFW-68 (300WLP) | NME (box) | 2.57 | LUVLI |
| Facial Recognition and Modelling | 300W (Common) | NME | 2.76 | LUVLi |
| Facial Recognition and Modelling | Menpo | AUC (box) | 71.9 | LUVLi |
| Facial Recognition and Modelling | Menpo | NME (box) | 2.04 | LUVLi |
| Facial Recognition and Modelling | 300W Split 2 (300W-LP) | AUC@7 (bbox) | 70.2 | LUVLI |
| Facial Recognition and Modelling | 300W Split 2 (300W-LP) | NME (bbox) | 2.1 | LUVLI |
| Facial Recognition and Modelling | AFLW-19 | AUC_box@0.07 (%, Full) | 68 | LUVLi |
| Facial Recognition and Modelling | AFLW-19 | NME_box (%, Full) | 2.28 | LUVLi |
| Facial Recognition and Modelling | AFLW-19 | NME_diag (%, Frontal) | 1.19 | LUVLi |
| Facial Recognition and Modelling | AFLW-19 | NME_diag (%, Full) | 1.39 | LUVLi |
| Facial Recognition and Modelling | COFW-68 | AUC@7 (box) | 60.8 | LUVLi |
| Facial Recognition and Modelling | COFW-68 | NME (box) | 2.75 | LUVLi |
| Facial Recognition and Modelling | MERL-RAV | AUC@7 (box) | 77.08 | LUVLi |
| Facial Recognition and Modelling | MERL-RAV | NME (box) | 1.61 | LUVLi |
| Facial Recognition and Modelling | WFLW | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| Facial Recognition and Modelling | WFLW | FR@10 (inter-ocular) | 3.12 | LUVLi |
| Facial Recognition and Modelling | WFLW | NME (inter-ocular) | 4.37 | LUVLi |
| Facial Recognition and Modelling | 300W Split 2 | AUC@7 (box) | 68.3 | LUVLi |
| Facial Recognition and Modelling | 300W Split 2 | NME (box) | 2.24 | LUVLi |
| Face Reconstruction | 300W Split 2 (300W-LP) | AUC@7 (bbox) | 70.2 | LUVLI |
| Face Reconstruction | 300W Split 2 (300W-LP) | NME (bbox) | 2.1 | LUVLI |
| Face Reconstruction | COFW-68 (300WLP) | AUC@7 | 63.4 | LUVLI |
| Face Reconstruction | COFW-68 (300WLP) | NME (box) | 2.57 | LUVLI |
| Face Reconstruction | MERL-RAV | AUC@7 (box) | 77.08 | LUVLi |
| Face Reconstruction | MERL-RAV | NME (box) | 1.61 | LUVLi |
| Face Reconstruction | COFW-68 | AUC@7 (box) | 60.8 | LUVLi |
| Face Reconstruction | COFW-68 | NME (box) | 2.75 | LUVLi |
| Face Reconstruction | 300W (Common) | NME | 2.76 | LUVLi |
| Face Reconstruction | WFW (Extra Data) | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| Face Reconstruction | WFW (Extra Data) | FR@10 (inter-ocular) | 3.12 | LUVLi |
| Face Reconstruction | WFW (Extra Data) | NME (inter-ocular) | 4.37 | LUVLi |
| Face Reconstruction | 300W Split 2 | AUC@7 (box) | 68.3 | LUVLi |
| Face Reconstruction | 300W Split 2 | NME (box) | 2.24 | LUVLi |
| Face Reconstruction | AFLW-19 | AUC_box@0.07 (%, Full) | 68 | LUVLi |
| Face Reconstruction | AFLW-19 | NME_box (%, Full) | 2.28 | LUVLi |
| Face Reconstruction | AFLW-19 | NME_diag (%, Frontal) | 1.19 | LUVLi |
| Face Reconstruction | AFLW-19 | NME_diag (%, Full) | 1.39 | LUVLi |
| Face Reconstruction | Menpo | AUC (box) | 71.9 | LUVLi |
| Face Reconstruction | Menpo | NME (box) | 2.04 | LUVLi |
| Face Reconstruction | WFLW | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| Face Reconstruction | WFLW | FR@10 (inter-ocular) | 3.12 | LUVLi |
| Face Reconstruction | WFLW | NME (inter-ocular) | 4.37 | LUVLi |
| 3D | 300W Split 2 (300W-LP) | AUC@7 (bbox) | 70.2 | LUVLI |
| 3D | 300W Split 2 (300W-LP) | NME (bbox) | 2.1 | LUVLI |
| 3D | COFW-68 (300WLP) | AUC@7 | 63.4 | LUVLI |
| 3D | COFW-68 (300WLP) | NME (box) | 2.57 | LUVLI |
| 3D | MERL-RAV | AUC@7 (box) | 77.08 | LUVLi |
| 3D | MERL-RAV | NME (box) | 1.61 | LUVLi |
| 3D | COFW-68 | AUC@7 (box) | 60.8 | LUVLi |
| 3D | COFW-68 | NME (box) | 2.75 | LUVLi |
| 3D | 300W (Common) | NME | 2.76 | LUVLi |
| 3D | WFW (Extra Data) | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D | WFW (Extra Data) | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D | WFW (Extra Data) | NME (inter-ocular) | 4.37 | LUVLi |
| 3D | 300W Split 2 | AUC@7 (box) | 68.3 | LUVLi |
| 3D | 300W Split 2 | NME (box) | 2.24 | LUVLi |
| 3D | AFLW-19 | AUC_box@0.07 (%, Full) | 68 | LUVLi |
| 3D | AFLW-19 | NME_box (%, Full) | 2.28 | LUVLi |
| 3D | AFLW-19 | NME_diag (%, Frontal) | 1.19 | LUVLi |
| 3D | AFLW-19 | NME_diag (%, Full) | 1.39 | LUVLi |
| 3D | Menpo | AUC (box) | 71.9 | LUVLi |
| 3D | Menpo | NME (box) | 2.04 | LUVLi |
| 3D | WFLW | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D | WFLW | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D | WFLW | NME (inter-ocular) | 4.37 | LUVLi |
| 3D Face Modelling | WFW (Extra Data) | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D Face Modelling | WFW (Extra Data) | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D Face Modelling | WFW (Extra Data) | NME (inter-ocular) | 4.37 | LUVLi |
| 3D Face Modelling | COFW-68 (300WLP) | AUC@7 | 63.4 | LUVLI |
| 3D Face Modelling | COFW-68 (300WLP) | NME (box) | 2.57 | LUVLI |
| 3D Face Modelling | 300W (Common) | NME | 2.76 | LUVLi |
| 3D Face Modelling | Menpo | AUC (box) | 71.9 | LUVLi |
| 3D Face Modelling | Menpo | NME (box) | 2.04 | LUVLi |
| 3D Face Modelling | 300W Split 2 (300W-LP) | AUC@7 (bbox) | 70.2 | LUVLI |
| 3D Face Modelling | 300W Split 2 (300W-LP) | NME (bbox) | 2.1 | LUVLI |
| 3D Face Modelling | AFLW-19 | AUC_box@0.07 (%, Full) | 68 | LUVLi |
| 3D Face Modelling | AFLW-19 | NME_box (%, Full) | 2.28 | LUVLi |
| 3D Face Modelling | AFLW-19 | NME_diag (%, Frontal) | 1.19 | LUVLi |
| 3D Face Modelling | AFLW-19 | NME_diag (%, Full) | 1.39 | LUVLi |
| 3D Face Modelling | COFW-68 | AUC@7 (box) | 60.8 | LUVLi |
| 3D Face Modelling | COFW-68 | NME (box) | 2.75 | LUVLi |
| 3D Face Modelling | MERL-RAV | AUC@7 (box) | 77.08 | LUVLi |
| 3D Face Modelling | MERL-RAV | NME (box) | 1.61 | LUVLi |
| 3D Face Modelling | WFLW | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D Face Modelling | WFLW | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D Face Modelling | WFLW | NME (inter-ocular) | 4.37 | LUVLi |
| 3D Face Modelling | 300W Split 2 | AUC@7 (box) | 68.3 | LUVLi |
| 3D Face Modelling | 300W Split 2 | NME (box) | 2.24 | LUVLi |
| 3D Face Reconstruction | WFW (Extra Data) | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D Face Reconstruction | WFW (Extra Data) | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D Face Reconstruction | WFW (Extra Data) | NME (inter-ocular) | 4.37 | LUVLi |
| 3D Face Reconstruction | COFW-68 (300WLP) | AUC@7 | 63.4 | LUVLI |
| 3D Face Reconstruction | COFW-68 (300WLP) | NME (box) | 2.57 | LUVLI |
| 3D Face Reconstruction | 300W (Common) | NME | 2.76 | LUVLi |
| 3D Face Reconstruction | Menpo | AUC (box) | 71.9 | LUVLi |
| 3D Face Reconstruction | Menpo | NME (box) | 2.04 | LUVLi |
| 3D Face Reconstruction | 300W Split 2 (300W-LP) | AUC@7 (bbox) | 70.2 | LUVLI |
| 3D Face Reconstruction | 300W Split 2 (300W-LP) | NME (bbox) | 2.1 | LUVLI |
| 3D Face Reconstruction | AFLW-19 | AUC_box@0.07 (%, Full) | 68 | LUVLi |
| 3D Face Reconstruction | AFLW-19 | NME_box (%, Full) | 2.28 | LUVLi |
| 3D Face Reconstruction | AFLW-19 | NME_diag (%, Frontal) | 1.19 | LUVLi |
| 3D Face Reconstruction | AFLW-19 | NME_diag (%, Full) | 1.39 | LUVLi |
| 3D Face Reconstruction | COFW-68 | AUC@7 (box) | 60.8 | LUVLi |
| 3D Face Reconstruction | COFW-68 | NME (box) | 2.75 | LUVLi |
| 3D Face Reconstruction | MERL-RAV | AUC@7 (box) | 77.08 | LUVLi |
| 3D Face Reconstruction | MERL-RAV | NME (box) | 1.61 | LUVLi |
| 3D Face Reconstruction | WFLW | AUC@10 (inter-ocular) | 57.7 | LUVLi |
| 3D Face Reconstruction | WFLW | FR@10 (inter-ocular) | 3.12 | LUVLi |
| 3D Face Reconstruction | WFLW | NME (inter-ocular) | 4.37 | LUVLi |
| 3D Face Reconstruction | 300W Split 2 | AUC@7 (box) | 68.3 | LUVLi |
| 3D Face Reconstruction | 300W Split 2 | NME (box) | 2.24 | LUVLi |