Unsupervised Learning of Object Landmarks through Conditional Image Generation

Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

2018-06-20NeurIPS 2018 12Unsupervised Facial Landmark Detection Image Generation Conditional Image Generation

Abstract

We propose a method for learning landmark detectors for visual objects (such as the eyes and the nose in a face) without any manual supervision. We cast this as the problem of generating images that combine the appearance of the object as seen in a first example image with the geometry of the object as seen in a second example image, where the two examples differ by a viewpoint change and/or an object deformation. In order to factorize appearance and geometry, we introduce a tight bottleneck in the geometry-extraction process that selects and distils geometry-related features. Compared to standard image generation problems, which often use generative adversarial networks, our generation task is conditioned on both appearance and geometry and thus is significantly less ambiguous, to the point that adopting a simple perceptual loss formulation is sufficient. We demonstrate that our approach can learn object landmarks from synthetic image deformations or videos, all without manual supervision, while outperforming state-of-the-art unsupervised landmark detectors. We further show that our method is applicable to a large variety of datasets - faces, people, 3D objects, and digits - without any modifications.

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	MAFL	NME	2.54	Conditional Image Generation
Facial Recognition and Modelling	MAFL Unaligned	NME	8.74	IMM
Facial Recognition and Modelling	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation
Facial Landmark Detection	MAFL	NME	2.54	Conditional Image Generation
Facial Landmark Detection	MAFL Unaligned	NME	8.74	IMM
Facial Landmark Detection	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation
Face Reconstruction	MAFL	NME	2.54	Conditional Image Generation
Face Reconstruction	MAFL Unaligned	NME	8.74	IMM
Face Reconstruction	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation
3D	MAFL	NME	2.54	Conditional Image Generation
3D	MAFL Unaligned	NME	8.74	IMM
3D	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation
3D Face Modelling	MAFL	NME	2.54	Conditional Image Generation
3D Face Modelling	MAFL Unaligned	NME	8.74	IMM
3D Face Modelling	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation
3D Face Reconstruction	MAFL	NME	2.54	Conditional Image Generation
3D Face Reconstruction	MAFL Unaligned	NME	8.74	IMM
3D Face Reconstruction	AFLW (Zhang CVPR 2018 crops)	NME	6.31	Conditional Image Generation

Unsupervised Learning of Object Landmarks through Conditional Image Generation

Abstract

Results

Related Papers

Unsupervised Learning of Object Landmarks through Conditional Image Generation

Abstract

Results

Related Papers