Felix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund
In this work, we present a new single-stage method for subject agnostic face swapping and identity transfer, named FaceDancer. We have two major contributions: Adaptive Feature Fusion Attention (AFFA) and Interpreted Feature Similarity Regularization (IFSR). The AFFA module is embedded in the decoder and adaptively learns to fuse attribute features and features conditioned on identity information without requiring any additional facial segmentation process. In IFSR, we leverage the intermediate features in an identity encoder to preserve important attributes such as head pose, facial expression, lighting, and occlusion in the target face, while still transferring the identity of the source face with high fidelity. We conduct extensive quantitative and qualitative experiments on various datasets and show that the proposed FaceDancer outperforms other state-of-the-art networks in terms of identityn transfer, while having significantly better pose preservation than most of the previous methods.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| Facial Recognition and Modelling | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| Facial Recognition and Modelling | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| Facial Recognition and Modelling | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| Facial Recognition and Modelling | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| Facial Recognition and Modelling | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| Facial Recognition and Modelling | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| Facial Recognition and Modelling | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| Facial Recognition and Modelling | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| Facial Recognition and Modelling | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| Facial Recognition and Modelling | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| Facial Recognition and Modelling | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| Facial Recognition and Modelling | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| Facial Recognition and Modelling | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| Facial Recognition and Modelling | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| Facial Recognition and Modelling | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| Facial Recognition and Modelling | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| Facial Recognition and Modelling | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| Facial Recognition and Modelling | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| Facial Recognition and Modelling | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| Facial Recognition and Modelling | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |
| Face Reconstruction | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| Face Reconstruction | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| Face Reconstruction | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| Face Reconstruction | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| Face Reconstruction | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| Face Reconstruction | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| Face Reconstruction | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| Face Reconstruction | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| Face Reconstruction | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| Face Reconstruction | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| Face Reconstruction | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| Face Reconstruction | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| Face Reconstruction | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| Face Reconstruction | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| Face Reconstruction | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| Face Reconstruction | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| Face Reconstruction | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| Face Reconstruction | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| Face Reconstruction | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| Face Reconstruction | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| Face Reconstruction | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |
| 3D | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| 3D | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| 3D | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| 3D | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| 3D | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| 3D | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| 3D | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| 3D | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| 3D | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| 3D | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| 3D | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| 3D | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| 3D | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| 3D | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| 3D | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| 3D | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| 3D | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| 3D | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| 3D | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| 3D | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| 3D | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |
| 3D Face Modelling | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| 3D Face Modelling | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| 3D Face Modelling | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| 3D Face Modelling | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| 3D Face Modelling | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| 3D Face Modelling | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| 3D Face Modelling | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| 3D Face Modelling | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| 3D Face Modelling | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| 3D Face Modelling | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| 3D Face Modelling | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| 3D Face Modelling | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| 3D Face Modelling | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| 3D Face Modelling | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| 3D Face Modelling | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| 3D Face Modelling | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| 3D Face Modelling | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| 3D Face Modelling | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| 3D Face Modelling | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| 3D Face Modelling | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| 3D Face Modelling | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |
| 3D Face Reconstruction | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| 3D Face Reconstruction | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| 3D Face Reconstruction | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| 3D Face Reconstruction | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| 3D Face Reconstruction | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| 3D Face Reconstruction | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| 3D Face Reconstruction | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| 3D Face Reconstruction | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| 3D Face Reconstruction | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| 3D Face Reconstruction | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| 3D Face Reconstruction | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| 3D Face Reconstruction | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| 3D Face Reconstruction | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| 3D Face Reconstruction | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| 3D Face Reconstruction | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| 3D Face Reconstruction | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| 3D Face Reconstruction | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| 3D Face Reconstruction | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| 3D Face Reconstruction | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| 3D Face Reconstruction | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| 3D Face Reconstruction | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |
| 10-shot image generation | AFLW2000-3D | ID retrieval | 98.5 | FaceDancer (Config A) |
| 10-shot image generation | AFLW2000-3D | exp embedding L2 | 7.07 | FaceDancer (Config A) |
| 10-shot image generation | AFLW2000-3D | pose | 14.97 | FaceDancer (Config A) |
| 10-shot image generation | AFLW2000-3D | ID retrieval | 97.95 | FaceDancer (Config B) |
| 10-shot image generation | AFLW2000-3D | exp embedding L2 | 5.74 | FaceDancer (Config B) |
| 10-shot image generation | AFLW2000-3D | pose | 5.86 | FaceDancer (Config B) |
| 10-shot image generation | AFLW2000-3D | ID retrieval | 97.65 | FaceDancer (Config C) |
| 10-shot image generation | AFLW2000-3D | exp embedding L2 | 4.13 | FaceDancer (Config C) |
| 10-shot image generation | AFLW2000-3D | pose | 5.82 | FaceDancer (Config C) |
| 10-shot image generation | AFLW2000-3D | ID retrieval | 97.1 | FaceDancer (Config D) |
| 10-shot image generation | AFLW2000-3D | exp embedding L2 | 4.15 | FaceDancer (Config D) |
| 10-shot image generation | AFLW2000-3D | pose | 5.75 | FaceDancer (Config D) |
| 10-shot image generation | FaceForensics++ | ID retrieval | 98.84 | FaceDancer (Config C) |
| 10-shot image generation | FaceForensics++ | exp embedding L2 | 7.97 | FaceDancer (Config C) |
| 10-shot image generation | FaceForensics++ | pose | 2.04 | FaceDancer (Config C) |
| 10-shot image generation | FaceForensics++ | ID retrieval | 98.19 | FaceDancer (Config D) |
| 10-shot image generation | FaceForensics++ | exp embedding L2 | 5.7 | FaceDancer (Config D) |
| 10-shot image generation | FaceForensics++ | pose | 2.15 | FaceDancer (Config D) |
| 10-shot image generation | FaceForensics++ | ID retrieval | 98.54 | FaceDancer (Config B) |
| 10-shot image generation | FaceForensics++ | exp embedding L2 | 8.52 | FaceDancer (Config B) |
| 10-shot image generation | FaceForensics++ | pose | 2.24 | FaceDancer (Config B) |