Back to timeline

Agentic Vision in Gemini 3 Flash

Google introduces Agentic Vision in Gemini 3 Flash, enabling step-by-step visual investigation grounded with code execution.

Tool

What Happened

Google announced Agentic Vision as a capability in Gemini 3 Flash that treats vision understanding as an active, tool-augmented investigation.

Why It Matters

This is an explicit “vision + tools” milestone: instead of one-shot image understanding, the model can plan, zoom/inspect, and ground answers via code execution—an important direction for document analysis, forensics, and agentic UX.

Technical Details