TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/In2I : Unsupervised Multi-Image-to-Image Translation Using...

In2I : Unsupervised Multi-Image-to-Image Translation Using Generative Adversarial Networks

Pramuditha Perera, Mahdi Abavisani, Vishal M. Patel

2017-11-26Multimodal Unsupervised Image-To-Image TranslationUnsupervised Image-To-Image TranslationTranslationImage-to-Image Translation
PaperPDFCode

Abstract

In unsupervised image-to-image translation, the goal is to learn the mapping between an input image and an output image using a set of unpaired training images. In this paper, we propose an extension of the unsupervised image-to-image translation problem to multiple input setting. Given a set of paired images from multiple modalities, a transformation is learned to translate the input into a specified domain. For this purpose, we introduce a Generative Adversarial Network (GAN) based framework along with a multi-modal generator structure and a new loss term, latent consistency loss. Through various experiments we show that leveraging multiple inputs generally improves the visual quality of the translated images. Moreover, we show that the proposed method outperforms current state-of-the-art unsupervised image-to-image translation methods.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationFreiburg Forest DatasetPSNR21.65In2I
Image-to-Image TranslationEPFL NIR-VISPSNR23.11In2I
Image GenerationFreiburg Forest DatasetPSNR21.65In2I
Image GenerationEPFL NIR-VISPSNR23.11In2I
Unsupervised Image-To-Image TranslationFreiburg Forest DatasetPSNR21.65In2I
1 Image, 2*2 StitchingFreiburg Forest DatasetPSNR21.65In2I
1 Image, 2*2 StitchingEPFL NIR-VISPSNR23.11In2I

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29