Google Unveils Gemini 3.5 Flash, Pro Coming Soon

Google announced the Gemini 3.5 family with Gemini 3.5 Flash released publicly and Gemini 3.5 Pro used internally and expected next month. Flash targets agent workflows and complex, long-horizon programming tasks and is integrated across consumer apps, developer platforms, and enterprise tools. It offers massive context and output windows and a new Interactions API for server-side history, but comes with significantly higher pricing—roughly 3x–6x previous Flash rates—pushing costs near Gemini 3.1 Pro. The rollout highlights Google’s tiered model strategy, aiming to balance advanced capabilities with differentiated offerings for developers and enterprises amid rising LLM costs.

Latest Changes

Gemini 3.5 Flash launched publicly and deployed across consumer, developer, and enterprise products

Gemini 3.5 Pro is in internal use and expected to be released next month

Flash targets agent workflows and long-horizon programming with massive context and output windows

New Interactions API introduced for server-side history and multimodal interactions

Flash pricing is roughly 3x–6x previous Flash rates, approaching Gemini 3.1 Pro costs

Timeline

2026-05-19 — Google unveiled the Gemini 3.5 model family and debuted Gemini 3.5 Flash at Google I/O

2026-05-19 — Gemini 3.5 Flash made generally available across consumer apps and developer platforms

2026-05-20 — Reports confirmed Gemini 3.5 Pro is used internally and slated for public release next month

2026-05-21 — Coverage emphasized Google plans to use Flash broadly across its ecosystem despite higher pricing

Recent News (4)

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Google announced Gemini 3.5 Flash at Google I/O and has deployed it broadly across consumer, developer, and enterprise products, including the Gemini app, AI Mode in Search, Google Antigravity, Gemini API in AI Studio and Android Studio, and Gemini Enterprise Agent Platform. The model (gemini-3.5-flash) supports a January 2025 knowledge cutoff, 1,048,576 input tokens and 65,536 output tokens, but lacks computer use. Google also introduced a beta Interactions API for server-side history management. Crucially, pricing jumped substantially—about 3x–6x versus earlier Flash variants ($1.50/million input, $9/million output), nearing Gemini 3.1 Pro rates—part of a wider industry trend of higher costs for top-tier models. Benchmarks show much higher run costs compared with prior Geminis and competitors.

src_agent-collectrss-simonwillison4h ago

@Balder13946731: Gemini 3.5 Flash, 仅仅是快速的版本就与深度思考的Opus 4.7 和 GPT5.5相当，那么毫无疑问 Gemini 3.5 Pro将强得可怕。 Antigravity和操作系统以及整个Google生态深度整合，无形之中与

src_sopilot @Balder13946731 (0粉)1d ago

谷歌正式发布Gemini 3.5：首发3.5 Flash，3.5 Pro下月推出

Google on May 19 unveiled the Gemini 3.5 model family, debuting Gemini 3.5 Flash as the first public release and announcing that a higher-tier Gemini 3.5 Pro is in internal use and slated for public release next month. Gemini 3.5 Flash is positioned to excel in agent-based workflows and programming, focusing on tackling complex, long-horizon tasks that deliver practical application value. The staged rollout signals Google’s push to iterate on its large language model lineup with differentiated tiers for performance and capabilities, intensifying competition in advanced AI models and developer tools.

Google Unveils Gemini 3.5 Flash, Pro Coming Soon

Why It Matters

Latest Changes

Timeline

What to Watch

Recent News (4)