DALL·E 2

Computer VisionIntroduced 20002 papers

Description

DALL·E 2 is a generative text-to-image model made up of two main components: a prior that generates a CLIP image embedding given a text caption, and a decoder that generates an image conditioned on the image embedding.

Papers Using This Method