TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-Channel Attention Selection GAN with Cascaded Semant...

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan

2019-04-15CVPR 2019 6TranslationCross-View Image-to-Image TranslationImage-to-Image Translation
PaperPDFCodeCodeCode

Abstract

Cross-view image translation is challenging because it involves images with drastically different views and severe deformation. In this paper, we propose a novel approach named Multi-Channel Attention SelectionGAN (SelectionGAN) that makes it possible to generate images of natural scenes in arbitrary viewpoints, based on an image of the scene and a novel semantic map. The proposed SelectionGAN explicitly utilizes the semantic information and consists of two stages. In the first stage, the condition image and the target semantic map are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using a multi-channel attention selection mechanism. Moreover, uncertainty maps automatically learned from attentions are used to guide the pixel loss for better network optimization. Extensive experiments on Dayton, CVUSA and Ego2Top datasets show that our model is able to generate significantly better results than the state-of-the-art methods. The source code, data and trained models are available at https://github.com/Ha0Tang/SelectionGAN.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationDayton (64x64) - ground-to-aerialSSIM0.5118SelectionGAN
Image-to-Image TranslationcvusaSSIM0.5323SelectionGAN
Image-to-Image TranslationDayton (64×64) - aerial-to-groundSSIM0.6865SelectionGAN
Image-to-Image TranslationEgo2TopSSIM0.6024SelectionGAN
Image-to-Image TranslationDayton (256×256) - ground-to-aerialSSIM0.3284SelectionGAN
Image-to-Image TranslationDayton (256×256) - aerial-to-groundSSIM0.5938SelectionGAN
Image GenerationDayton (64x64) - ground-to-aerialSSIM0.5118SelectionGAN
Image GenerationcvusaSSIM0.5323SelectionGAN
Image GenerationDayton (64×64) - aerial-to-groundSSIM0.6865SelectionGAN
Image GenerationEgo2TopSSIM0.6024SelectionGAN
Image GenerationDayton (256×256) - ground-to-aerialSSIM0.3284SelectionGAN
Image GenerationDayton (256×256) - aerial-to-groundSSIM0.5938SelectionGAN
1 Image, 2*2 StitchingDayton (64x64) - ground-to-aerialSSIM0.5118SelectionGAN
1 Image, 2*2 StitchingcvusaSSIM0.5323SelectionGAN
1 Image, 2*2 StitchingDayton (64×64) - aerial-to-groundSSIM0.6865SelectionGAN
1 Image, 2*2 StitchingEgo2TopSSIM0.6024SelectionGAN
1 Image, 2*2 StitchingDayton (256×256) - ground-to-aerialSSIM0.3284SelectionGAN
1 Image, 2*2 StitchingDayton (256×256) - aerial-to-groundSSIM0.5938SelectionGAN

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29