TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Guided Image-to-Image Translation with Bi-Directional Feat...

Guided Image-to-Image Translation with Bi-Directional Feature Transformation

Badour AlBahar, Jia-Bin Huang

2019-10-24ICCV 2019 10Pose TransferTranslationImage-to-Image Translation
PaperPDFCode(official)

Abstract

We address the problem of guided image-to-image translation where we translate an input image into another while respecting the constraints provided by an external, user-provided guidance image. Various conditioning methods for leveraging the given guidance image have been explored, including input concatenation , feature concatenation, and conditional affine transformation of feature activations. All these conditioning mechanisms, however, are uni-directional, i.e., no information flow from the input image back to the guidance. To better utilize the constraints of the guidance image, we present a bi-directional feature transformation (bFT) scheme. We show that our bFT scheme outperforms other conditioning schemes and has comparable results to state-of-the-art methods on different tasks.

Results

TaskDatasetMetricValueModel
Image GenerationDeep-FashionFID12.266bFT
Image GenerationDeep-FashionIS3.22bFT
Image GenerationDeep-FashionSSIM0.767bFT
Image ReconstructionEdge-to-ClothesFID58.4bFT
Image ReconstructionEdge-to-ClothesLPIPS0.1bFT
Image ReconstructionEdge-to-HandbagsFID74.9bFT
Image ReconstructionEdge-to-HandbagsLPIPS0.2bFT
Image ReconstructionEdge-to-ShoesFID121.2bFT
Image ReconstructionEdge-to-ShoesLPIPS0.1bFT

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29