Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio, R. Devon Hjelm, Shikhar Sharma

2020-03-16Layout-to-Image Generation Image Generation

Abstract

Despite recent impressive results on single-object and single-domain image generation, the generation of complex scenes with multiple objects remains challenging. In this paper, we start with the idea that a model must be able to understand individual objects and relationships between objects in order to generate complex scenes well. Our layout-to-image-generation method, which we call Object-Centric Generative Adversarial Network (or OC-GAN), relies on a novel Scene-Graph Similarity Module (SGSM). The SGSM learns representations of the spatial relationships between objects in the scene, which lead to our model's improved layout-fidelity. We also propose changes to the conditioning mechanism of the generator that enhance its object instance-awareness. Apart from improving image quality, our contributions mitigate two failure modes in previous approaches: (1) spurious objects being generated without corresponding bounding boxes in the layout, and (2) overlapping bounding boxes in the layout leading to merged objects in images. Extensive quantitative evaluation and ablation studies demonstrate the impact of our contributions, with our model outperforming previous state-of-the-art approaches on both the COCO-Stuff and Visual Genome datasets. Finally, we address an important limitation of evaluation metrics used in previous works by introducing SceneFID -- an object-centric adaptation of the popular Fr{\'e}chet Inception Distance metric, that is better suited for multi-object images.

Results

Task	Dataset	Metric	Value	Model
Image Generation	COCO-Stuff 128x128	FID	36.31	OC-GAN
Image Generation	COCO-Stuff 128x128	Inception Score	14.6	OC-GAN
Image Generation	COCO-Stuff 128x128	SceneFID	16.76	OC-GAN
Image Generation	COCO-Stuff 64x64	FID	29.57	OC-GAN
Image Generation	COCO-Stuff 64x64	Inception Score	10.8	OC-GAN
Image Generation	Visual Genome 64x64	FID	20.27	OC-GAN
Image Generation	Visual Genome 64x64	Inception Score	9.3	OC-GAN
Image Generation	COCO-Stuff 256x256	FID	41.65	OC-GAN
Image Generation	COCO-Stuff 256x256	Inception Score	17.8	OC-GAN
Image Generation	Visual Genome 128x128	FID	28.26	OC-GAN
Image Generation	Visual Genome 128x128	Inception Score	12.3	OC-GAN
Image Generation	Visual Genome 128x128	SceneFID	9.63	OC-GAN
Image Generation	Visual Genome 256x256	FID	40.85	OC-GAN
Image Generation	Visual Genome 256x256	Inception Score	14.7	OC-GAN

Object-Centric Image Generation from Layouts

Abstract

Results

Related Papers

Object-Centric Image Generation from Layouts

Abstract

Results

Related Papers