Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis

Shijie Li, Ming-Ming Cheng, Juergen Gall

2022-10-08Image Generation Image-to-Image Translation

Abstract

The goal of semantic image synthesis is to generate photo-realistic images from semantic label maps. It is highly relevant for tasks like content generation and image editing. Current state-of-the-art approaches, however, still struggle to generate realistic objects in images at various scales. In particular, small objects tend to fade away and large objects are often generated as collages of patches. In order to address this issue, we propose a Dual Pyramid Generative Adversarial Network (DP-GAN) that learns the conditioning of spatially-adaptive normalization blocks at all scales jointly, such that scale information is bi-directionally used, and it unifies supervision at different scales. Our qualitative and quantitative results show that the proposed approach generates images where small and large objects look more realistic compared to images generated by state-of-the-art methods.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	Cityscapes Labels-to-Photo	FID	44.1	DP-GAN
Image-to-Image Translation	Cityscapes Labels-to-Photo	mIoU	73.6	DP-GAN
Image-to-Image Translation	ADE20K Labels-to-Photos	FID	26.1	DP-GAN
Image-to-Image Translation	ADE20K Labels-to-Photos	mIoU	52.7	DP-GAN
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	FID	45.8	DP-GAN
Image-to-Image Translation	ADE20K-Outdoor Labels-to-Photos	mIoU	40.4	DP-GAN
Image Generation	Cityscapes Labels-to-Photo	FID	44.1	DP-GAN
Image Generation	Cityscapes Labels-to-Photo	mIoU	73.6	DP-GAN
Image Generation	ADE20K Labels-to-Photos	FID	26.1	DP-GAN
Image Generation	ADE20K Labels-to-Photos	mIoU	52.7	DP-GAN
Image Generation	ADE20K-Outdoor Labels-to-Photos	FID	45.8	DP-GAN
Image Generation	ADE20K-Outdoor Labels-to-Photos	mIoU	40.4	DP-GAN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	FID	44.1	DP-GAN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	mIoU	73.6	DP-GAN
1 Image, 2*2 Stitching	ADE20K Labels-to-Photos	FID	26.1	DP-GAN
1 Image, 2*2 Stitching	ADE20K Labels-to-Photos	mIoU	52.7	DP-GAN
1 Image, 2*2 Stitching	ADE20K-Outdoor Labels-to-Photos	FID	45.8	DP-GAN
1 Image, 2*2 Stitching	ADE20K-Outdoor Labels-to-Photos	mIoU	40.4	DP-GAN

Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis

Abstract

Results

Related Papers

Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis

Abstract

Results

Related Papers