Back to timeline

DALL·E

OpenAI introduces DALL·E, a 12-billion parameter model that generates images from text descriptions, opening the era of text-to-image AI.

Model Release

What Happened

OpenAI revealed DALL·E, a 12-billion parameter version of GPT-3 modified to generate images from text captions. Named as a portmanteau of Salvador Dalí and Pixar's WALL·E, the model could create plausible images from natural language descriptions, including surreal combinations like "an armchair in the shape of an avocado."

Why It Matters

DALL·E demonstrated that the same autoregressive Transformer approach used for text could extend to image generation, proving that large language models could bridge modalities. It captured public imagination and sparked intense interest in text-to-image AI, paving the way for DALL·E 2, Midjourney, Stable Diffusion, and the broader generative AI art movement.

Technical Details