Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan

2019-04-15CVPR 2019 6Translation Cross-View Image-to-Image Translation Image-to-Image Translation

Abstract

Cross-view image translation is challenging because it involves images with drastically different views and severe deformation. In this paper, we propose a novel approach named Multi-Channel Attention SelectionGAN (SelectionGAN) that makes it possible to generate images of natural scenes in arbitrary viewpoints, based on an image of the scene and a novel semantic map. The proposed SelectionGAN explicitly utilizes the semantic information and consists of two stages. In the first stage, the condition image and the target semantic map are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using a multi-channel attention selection mechanism. Moreover, uncertainty maps automatically learned from attentions are used to guide the pixel loss for better network optimization. Extensive experiments on Dayton, CVUSA and Ego2Top datasets show that our model is able to generate significantly better results than the state-of-the-art methods. The source code, data and trained models are available at https://github.com/Ha0Tang/SelectionGAN.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	Dayton (64x64) - ground-to-aerial	SSIM	0.5118	SelectionGAN
Image-to-Image Translation	cvusa	SSIM	0.5323	SelectionGAN
Image-to-Image Translation	Dayton (64×64) - aerial-to-ground	SSIM	0.6865	SelectionGAN
Image-to-Image Translation	Ego2Top	SSIM	0.6024	SelectionGAN
Image-to-Image Translation	Dayton (256×256) - ground-to-aerial	SSIM	0.3284	SelectionGAN
Image-to-Image Translation	Dayton (256×256) - aerial-to-ground	SSIM	0.5938	SelectionGAN
Image Generation	Dayton (64x64) - ground-to-aerial	SSIM	0.5118	SelectionGAN
Image Generation	cvusa	SSIM	0.5323	SelectionGAN
Image Generation	Dayton (64×64) - aerial-to-ground	SSIM	0.6865	SelectionGAN
Image Generation	Ego2Top	SSIM	0.6024	SelectionGAN
Image Generation	Dayton (256×256) - ground-to-aerial	SSIM	0.3284	SelectionGAN
Image Generation	Dayton (256×256) - aerial-to-ground	SSIM	0.5938	SelectionGAN
1 Image, 2*2 Stitching	Dayton (64x64) - ground-to-aerial	SSIM	0.5118	SelectionGAN
1 Image, 2*2 Stitching	cvusa	SSIM	0.5323	SelectionGAN
1 Image, 2*2 Stitching	Dayton (64×64) - aerial-to-ground	SSIM	0.6865	SelectionGAN
1 Image, 2*2 Stitching	Ego2Top	SSIM	0.6024	SelectionGAN
1 Image, 2*2 Stitching	Dayton (256×256) - ground-to-aerial	SSIM	0.3284	SelectionGAN
1 Image, 2*2 Stitching	Dayton (256×256) - aerial-to-ground	SSIM	0.5938	SelectionGAN

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Abstract

Results

Related Papers

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Abstract

Results

Related Papers