Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljevic, Daniel Wilde, Stephan Garbin, Chirag Raman, Jamie Shotton, Toby Sharp, Ivan Stojiljkovic, Tom Cashman, Julien Valentin
Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering. Can we keep things simple by just using more landmarks? In answer, we present the first method that accurately predicts 10x as many landmarks as usual, covering the whole head, including the eyes and teeth. This is accomplished using synthetic training data, which guarantees perfect landmark annotations. By fitting a morphable model to these dense landmarks, we achieve state-of-the-art results for monocular 3D face reconstruction in the wild. We show that dense landmarks are an ideal signal for integrating face shape information across frames by demonstrating accurate and expressive facial performance capture in both monocular and multi-view scenarios. This approach is also highly efficient: we can predict dense landmarks and fit our 3D face model at over 150FPS on a single CPU thread. Please see our website: https://microsoft.github.io/DenseLandmarks/.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | 300W | NME_inter-ocular (%, Challenge) | 4.8 | DenseLandmarks (GNLL) |
| Facial Recognition and Modelling | 300W | NME_inter-ocular (%, Common) | 3.03 | DenseLandmarks (GNLL) |
| Facial Recognition and Modelling | Florence | RMSE Cooperative | 1.43 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | Florence | RMSE Indoor | 1.42 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | Florence | RMSE Outdoor | 1.42 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | Florence | RMSE Cooperative | 1.64 | DenseLandmarks (Single-view) |
| Facial Recognition and Modelling | Florence | RMSE Indoor | 1.62 | DenseLandmarks (Single-view) |
| Facial Recognition and Modelling | Florence | RMSE Outdoor | 1.61 | DenseLandmarks (Single-view) |
| Facial Recognition and Modelling | NoW Benchmark | Mean Reconstruction Error (mm) | 1.01 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | NoW Benchmark | Median Reconstruction Error | 0.81 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | NoW Benchmark | Stdev Reconstruction Error (mm) | 0.84 | DenseLandmarks (Multi-view) |
| Facial Recognition and Modelling | NoW Benchmark | Mean Reconstruction Error (mm) | 1.28 | DenseLandmarks (Single-view) |
| Facial Recognition and Modelling | NoW Benchmark | Median Reconstruction Error | 1.02 | DenseLandmarks (Single-view) |
| Facial Recognition and Modelling | NoW Benchmark | Stdev Reconstruction Error (mm) | 1.08 | DenseLandmarks (Single-view) |
| Face Reconstruction | Florence | RMSE Cooperative | 1.43 | DenseLandmarks (Multi-view) |
| Face Reconstruction | Florence | RMSE Indoor | 1.42 | DenseLandmarks (Multi-view) |
| Face Reconstruction | Florence | RMSE Outdoor | 1.42 | DenseLandmarks (Multi-view) |
| Face Reconstruction | Florence | RMSE Cooperative | 1.64 | DenseLandmarks (Single-view) |
| Face Reconstruction | Florence | RMSE Indoor | 1.62 | DenseLandmarks (Single-view) |
| Face Reconstruction | Florence | RMSE Outdoor | 1.61 | DenseLandmarks (Single-view) |
| Face Reconstruction | NoW Benchmark | Mean Reconstruction Error (mm) | 1.01 | DenseLandmarks (Multi-view) |
| Face Reconstruction | NoW Benchmark | Median Reconstruction Error | 0.81 | DenseLandmarks (Multi-view) |
| Face Reconstruction | NoW Benchmark | Stdev Reconstruction Error (mm) | 0.84 | DenseLandmarks (Multi-view) |
| Face Reconstruction | NoW Benchmark | Mean Reconstruction Error (mm) | 1.28 | DenseLandmarks (Single-view) |
| Face Reconstruction | NoW Benchmark | Median Reconstruction Error | 1.02 | DenseLandmarks (Single-view) |
| Face Reconstruction | NoW Benchmark | Stdev Reconstruction Error (mm) | 1.08 | DenseLandmarks (Single-view) |
| Face Reconstruction | 300W | NME_inter-ocular (%, Challenge) | 4.8 | DenseLandmarks (GNLL) |
| Face Reconstruction | 300W | NME_inter-ocular (%, Common) | 3.03 | DenseLandmarks (GNLL) |
| 3D | Florence | RMSE Cooperative | 1.43 | DenseLandmarks (Multi-view) |
| 3D | Florence | RMSE Indoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D | Florence | RMSE Outdoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D | Florence | RMSE Cooperative | 1.64 | DenseLandmarks (Single-view) |
| 3D | Florence | RMSE Indoor | 1.62 | DenseLandmarks (Single-view) |
| 3D | Florence | RMSE Outdoor | 1.61 | DenseLandmarks (Single-view) |
| 3D | NoW Benchmark | Mean Reconstruction Error (mm) | 1.01 | DenseLandmarks (Multi-view) |
| 3D | NoW Benchmark | Median Reconstruction Error | 0.81 | DenseLandmarks (Multi-view) |
| 3D | NoW Benchmark | Stdev Reconstruction Error (mm) | 0.84 | DenseLandmarks (Multi-view) |
| 3D | NoW Benchmark | Mean Reconstruction Error (mm) | 1.28 | DenseLandmarks (Single-view) |
| 3D | NoW Benchmark | Median Reconstruction Error | 1.02 | DenseLandmarks (Single-view) |
| 3D | NoW Benchmark | Stdev Reconstruction Error (mm) | 1.08 | DenseLandmarks (Single-view) |
| 3D | 300W | NME_inter-ocular (%, Challenge) | 4.8 | DenseLandmarks (GNLL) |
| 3D | 300W | NME_inter-ocular (%, Common) | 3.03 | DenseLandmarks (GNLL) |
| 3D Face Modelling | 300W | NME_inter-ocular (%, Challenge) | 4.8 | DenseLandmarks (GNLL) |
| 3D Face Modelling | 300W | NME_inter-ocular (%, Common) | 3.03 | DenseLandmarks (GNLL) |
| 3D Face Modelling | Florence | RMSE Cooperative | 1.43 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | Florence | RMSE Indoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | Florence | RMSE Outdoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | Florence | RMSE Cooperative | 1.64 | DenseLandmarks (Single-view) |
| 3D Face Modelling | Florence | RMSE Indoor | 1.62 | DenseLandmarks (Single-view) |
| 3D Face Modelling | Florence | RMSE Outdoor | 1.61 | DenseLandmarks (Single-view) |
| 3D Face Modelling | NoW Benchmark | Mean Reconstruction Error (mm) | 1.01 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | NoW Benchmark | Median Reconstruction Error | 0.81 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | NoW Benchmark | Stdev Reconstruction Error (mm) | 0.84 | DenseLandmarks (Multi-view) |
| 3D Face Modelling | NoW Benchmark | Mean Reconstruction Error (mm) | 1.28 | DenseLandmarks (Single-view) |
| 3D Face Modelling | NoW Benchmark | Median Reconstruction Error | 1.02 | DenseLandmarks (Single-view) |
| 3D Face Modelling | NoW Benchmark | Stdev Reconstruction Error (mm) | 1.08 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | Florence | RMSE Cooperative | 1.43 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | Florence | RMSE Indoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | Florence | RMSE Outdoor | 1.42 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | Florence | RMSE Cooperative | 1.64 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | Florence | RMSE Indoor | 1.62 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | Florence | RMSE Outdoor | 1.61 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | NoW Benchmark | Mean Reconstruction Error (mm) | 1.01 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | NoW Benchmark | Median Reconstruction Error | 0.81 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | NoW Benchmark | Stdev Reconstruction Error (mm) | 0.84 | DenseLandmarks (Multi-view) |
| 3D Face Reconstruction | NoW Benchmark | Mean Reconstruction Error (mm) | 1.28 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | NoW Benchmark | Median Reconstruction Error | 1.02 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | NoW Benchmark | Stdev Reconstruction Error (mm) | 1.08 | DenseLandmarks (Single-view) |
| 3D Face Reconstruction | 300W | NME_inter-ocular (%, Challenge) | 4.8 | DenseLandmarks (GNLL) |
| 3D Face Reconstruction | 300W | NME_inter-ocular (%, Common) | 3.03 | DenseLandmarks (GNLL) |