DALL·E 2

Computer VisionIntroduced 20002 papers

Description

DALL·E 2 is a generative text-to-image model made up of two main components: a prior that generates a CLIP image embedding given a text caption, and a decoder that generates an image conditioned on the image embedding.

Papers Using This Method

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT2023-03-07 Hierarchical Text-Conditional Image Generation with CLIP Latents2022-04-13