Guided Image-to-Image Translation with Bi-Directional Feature Transformation

Badour AlBahar, Jia-Bin Huang

2019-10-24ICCV 2019 10Pose Transfer Translation Image-to-Image Translation

Abstract

We address the problem of guided image-to-image translation where we translate an input image into another while respecting the constraints provided by an external, user-provided guidance image. Various conditioning methods for leveraging the given guidance image have been explored, including input concatenation , feature concatenation, and conditional affine transformation of feature activations. All these conditioning mechanisms, however, are uni-directional, i.e., no information flow from the input image back to the guidance. To better utilize the constraints of the guidance image, we present a bi-directional feature transformation (bFT) scheme. We show that our bFT scheme outperforms other conditioning schemes and has comparable results to state-of-the-art methods on different tasks.

Results

Task	Dataset	Metric	Value	Model
Image Generation	Deep-Fashion	FID	12.266	bFT
Image Generation	Deep-Fashion	IS	3.22	bFT
Image Generation	Deep-Fashion	SSIM	0.767	bFT
Image Reconstruction	Edge-to-Clothes	FID	58.4	bFT
Image Reconstruction	Edge-to-Clothes	LPIPS	0.1	bFT
Image Reconstruction	Edge-to-Handbags	FID	74.9	bFT
Image Reconstruction	Edge-to-Handbags	LPIPS	0.2	bFT
Image Reconstruction	Edge-to-Shoes	FID	121.2	bFT
Image Reconstruction	Edge-to-Shoes	LPIPS	0.1	bFT

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17 Function-to-Style Guidance of LLMs for Code Translation2025-07-15 Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09 Unconditional Diffusion for Generative Sequential Recommendation2025-07-08 GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01 CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29