Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, Victor Lempitsky

2019-05-20ICCV 2019 10Meta-Learning Talking Head Generation One-Shot Learning

Abstract

Several recent works have shown how highly realistic human head images can be obtained by training convolutional neural networks to generate them. In order to create a personalized talking head model, these works require training on a large dataset of images of a single person. However, in many practical scenarios, such personalized talking head models need to be learned from a few image views of a person, potentially even a single image. Here, we present a system with such few-shot capability. It performs lengthy meta-learning on a large dataset of videos, and after that is able to frame few- and one-shot learning of neural talking head models of previously unseen people as adversarial training problems with high capacity generators and discriminators. Crucially, the system is able to initialize the parameters of both the generator and the discriminator in a person-specific way, so that training can be based on just a few images and done quickly, despite the need to tune tens of millions of parameters. We show that such an approach is able to learn highly realistic and personalized talking head models of new people and even portrait paintings.

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
Facial Recognition and Modelling	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
Facial Recognition and Modelling	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
Facial Recognition and Modelling	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
Facial Recognition and Modelling	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
Facial Recognition and Modelling	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
Image Generation	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
Image Generation	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
Image Generation	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
Image Generation	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
Image Generation	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
Image Generation	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
Talking Head Generation	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
Face Generation	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
Face Generation	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
Face Generation	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
Face Generation	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
Face Generation	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
Face Generation	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
Face Reconstruction	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
3D	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
3D	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
3D	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
3D	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
3D	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
3D	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
3D Face Modelling	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
3D Face Reconstruction	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model
10-shot image generation	VoxCeleb2 - 8-shot learning	FID	42.2	Few-shot Adversarial Model
10-shot image generation	VoxCeleb2 - 1-shot learning	FID	48.5	Few-shot Adversarial Model
10-shot image generation	VoxCeleb1 - 8-shot learning	FID	38	Few-shot Adversarial Model
10-shot image generation	VoxCeleb1 - 32-shot learning	FID	29.5	Few-shot Adversarial Model
10-shot image generation	VoxCeleb1 - 1-shot learning	FID	43	Few-shot Adversarial Model
10-shot image generation	VoxCeleb2 - 32-shot learning	FID	30.6	Few-shot Adversarial Model

Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

Abstract

Results

Related Papers

Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

Abstract

Results

Related Papers