Back to timeline

Hello GPT-4o

OpenAI announces GPT-4o, a multimodal flagship model aimed at real-time interaction across text, vision, and audio.

Model Release

What Happened

OpenAI announced GPT-4o (“omni”) as a new flagship model designed to work across text, vision, and audio, including real-time conversational interaction.

Why It Matters

GPT-4o signaled a shift toward “native” multimodality as a default expectation for frontier assistants, enabling more natural voice-first and vision-first product experiences.

Technical Details