Photographic Image Synthesis with Cascaded Refinement Networks

Qifeng Chen, Vladlen Koltun

2017-07-28ICCV 2017 10Image Generation Image-to-Image Translation

Abstract

We present an approach to synthesizing photographic images conditioned on semantic layouts. Given a semantic label map, our approach produces an image with photographic appearance that conforms to the input layout. The approach thus functions as a rendering engine that takes a two-dimensional semantic specification of the scene and produces a corresponding photographic image. Unlike recent and contemporaneous work, our approach does not rely on adversarial training. We show that photographic images can be synthesized from semantic layouts by a single feedforward network with appropriate structure, trained end-to-end with a direct regression objective. The presented approach scales seamlessly to high resolutions; we demonstrate this by synthesizing photographic images at 2-megapixel resolution, the full resolution of our training data. Extensive perceptual experiments on datasets of outdoor and indoor scenes demonstrate that images synthesized by the presented approach are considerably more realistic than alternative approaches. The results are shown in the supplementary video at https://youtu.be/0fhUJT21-bs

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	FID	70.4	CRN
Image-to-Image Translation	COCO-Stuff Labels-to-Photos	mIoU	23.7	CRN
Image-to-Image Translation	Cityscapes Labels-to-Photo	FID	104.7	CRN
Image-to-Image Translation	Cityscapes Labels-to-Photo	mIoU	52.4	CRN
Image-to-Image Translation	ADE20K Labels-to-Photos	FID	73.3	CRN
Image-to-Image Translation	ADE20K Labels-to-Photos	mIoU	22.4	CRN
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	FID	99	CRN
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	mIoU	16.5	CRN
Image Generation	COCO-Stuff Labels-to-Photos	FID	70.4	CRN
Image Generation	COCO-Stuff Labels-to-Photos	mIoU	23.7	CRN
Image Generation	Cityscapes Labels-to-Photo	FID	104.7	CRN
Image Generation	Cityscapes Labels-to-Photo	mIoU	52.4	CRN
Image Generation	ADE20K Labels-to-Photos	FID	73.3	CRN
Image Generation	ADE20K Labels-to-Photos	mIoU	22.4	CRN
Image Generation	ADE20K-Outdoor Labels-to-Photos	FID	99	CRN
Image Generation	ADE20K-Outdoor Labels-to-Photos	mIoU	16.5	CRN
1 Image, 2*2 Stitching	COCO-Stuff Labels-to-Photos	FID	70.4	CRN
1 Image, 2*2 Stitching	COCO-Stuff Labels-to-Photos	mIoU	23.7	CRN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	FID	104.7	CRN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	mIoU	52.4	CRN
1 Image, 2*2 Stitching	ADE20K Labels-to-Photos	FID	73.3	CRN
1 Image, 2*2 Stitching	ADE20K Labels-to-Photos	mIoU	22.4	CRN
1 Image, 2*2 Stitching	ADE20K-Outdoor Labels-to-Photos	FID	99	CRN
1 Image, 2*2 Stitching	ADE20K-Outdoor Labels-to-Photos	mIoU	16.5	CRN

Photographic Image Synthesis with Cascaded Refinement Networks

Abstract

Results

Related Papers

Photographic Image Synthesis with Cascaded Refinement Networks

Abstract

Results

Related Papers