Learning from Simulated and Unsupervised Images through Adversarial Training

Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Josh Susskind, Wenda Wang, Russ Webb

2016-12-22CVPR 2017 7Pose Estimation Gaze Estimation Hand Pose Estimation Image-to-Image Translation Domain Adaptation

Paper PDF Code Code Code Code Code Code Code Code Code

Abstract

With recent progress in graphics, it has become more tractable to train models on synthetic images, potentially avoiding the need for expensive annotations. However, learning from synthetic images may not achieve the desired performance due to a gap between synthetic and real image distributions. To reduce this gap, we propose Simulated+Unsupervised (S+U) learning, where the task is to learn a model to improve the realism of a simulator's output using unlabeled real data, while preserving the annotation information from the simulator. We develop a method for S+U learning that uses an adversarial network similar to Generative Adversarial Networks (GANs), but with synthetic images as inputs instead of random vectors. We make several key modifications to the standard GAN algorithm to preserve annotations, avoid artifacts, and stabilize training: (i) a 'self-regularization' term, (ii) a local adversarial loss, and (iii) updating the discriminator using a history of refined images. We show that this enables generation of highly realistic images, which we demonstrate both qualitatively and with a user study. We quantitatively evaluate the generated images by training models for gaze estimation and hand pose estimation. We show a significant improvement over using synthetic images, and achieve state-of-the-art results on the MPIIGaze dataset without any labeled real data.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	Cityscapes Labels-to-Photo	Class IOU	0.04	SimGAN
Image-to-Image Translation	Cityscapes Photo-to-Labels	Class IOU	0.07	SimGAN
Image Generation	Cityscapes Labels-to-Photo	Class IOU	0.04	SimGAN
Image Generation	Cityscapes Photo-to-Labels	Class IOU	0.07	SimGAN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	Class IOU	0.04	SimGAN
1 Image, 2*2 Stitching	Cityscapes Photo-to-Labels	Class IOU	0.07	SimGAN

Learning from Simulated and Unsupervised Images through Adversarial Training

Abstract

Results

Related Papers

Learning from Simulated and Unsupervised Images through Adversarial Training

Abstract

Results

Related Papers