TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TR0N: Translator Networks for 0-Shot Plug-and-Play Conditi...

TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation

Zhaoyan Liu, Noel Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem

2023-04-26Text-to-Image GenerationImage Generation
PaperPDFCode(official)Code

Abstract

We propose TR0N, a highly general framework to turn pre-trained unconditional generative models, such as GANs and VAEs, into conditional models. The conditioning can be highly arbitrary, and requires only a pre-trained auxiliary model. For example, we show how to turn unconditional models into class-conditional ones with the help of a classifier, and also into text-to-image models by leveraging CLIP. TR0N learns a lightweight stochastic mapping which "translates" between the space of conditions and the latent space of the generative model, in such a way that the generated latent corresponds to a data sample satisfying the desired condition. The translated latent samples are then further improved upon through Langevin dynamics, enabling us to obtain higher-quality data samples. TR0N requires no training data nor fine-tuning, yet can achieve a zero-shot FID of 10.9 on MS-COCO, outperforming competing alternatives not only on this metric, but also in sampling speed -- all while retaining a much higher level of generality. Our code is available at https://github.com/layer6ai-labs/tr0n.

Results

TaskDatasetMetricValueModel
Image GenerationCOCO (Common Objects in Context)FID10.9TR0N (StyleGAN-XL, LAION2BCLIP, BLIP-2, zero-shot)
Text-to-Image GenerationCOCO (Common Objects in Context)FID10.9TR0N (StyleGAN-XL, LAION2BCLIP, BLIP-2, zero-shot)
10-shot image generationCOCO (Common Objects in Context)FID10.9TR0N (StyleGAN-XL, LAION2BCLIP, BLIP-2, zero-shot)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)FID10.9TR0N (StyleGAN-XL, LAION2BCLIP, BLIP-2, zero-shot)

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15