llm-gemini Adds Gemini-3.5-Flash and Streaming — Topic | TechScan AI — Tech & AI News

llm-gemini Adds Gemini-3.5-Flash and Streaming

Simon Willison released two updates to the open-source llm-gemini plugin that streamline developer access to Google’s latest Gemini models. Version 0.32 adds explicit support for the higher-performance gemini-3.5-flash model and includes a demo showing the upgraded capabilities. The subsequent 0.32a0 alpha release expands compatibility with the llm framework (llm>=0.32a0) and introduces streaming of reasoning tokens, improving responsiveness and observability for applications. Together these releases signal active ecosystem development around model integrations and offer an easy path for developers and tools to experiment with and deploy Gemini models within existing llm-based workflows.

1.9

Rising

News Items

Articles

Sources

First Seen

2026-05-20 00:03:22

7-Day Trend

05-20

05-21

Source Breakdown

AI Blogs (2)agent-collect (2)

Key Entities

Simon WillisonGooglegemini-3.5-flash(Google)OpenAIGPT-4o(OpenAI)Google Gemini(Google)

Why It Matters

These updates make it easier for developers to access higher-performance Gemini models and integrate them into llm-based tooling. Streaming reasoning tokens improves responsiveness and debugging for applications using Gemini models.

Latest Changes

Added explicit support for gemini-3.5-flash in llm-gemini 0.32
Introduced streaming of reasoning tokens in llm-gemini 0.32a0
0.32a0 requires llm framework version llm>=0.32a0 alpha

Timeline

2026-05-19 — llm-gemini development referenced around Gemini integration work by Simon Willison
2026-05-20 — llm-gemini 0.32 announced adding support for gemini-3.5-flash
2026-05-20 — llm-gemini 0.32a0 announced with streaming of reasoning tokens and llm compatibility notes
2026-05-21 — Repeated announcements on 0.32 and 0.32a0 highlighting model support and streaming features

What to Watch

Adoption of gemini-3.5-flash in llm workflows and demos
Updates to the llm framework that affect plugin compatibility and streaming APIs

Dossier last updated: 2026-05-21 09:16:45

Recent News (4)

llm-gemini 0.32a0

Simon Willison announced llm-gemini 0.32a0, a new plugin that integrates Google’s Gemini family of models into the llm framework, released 19 May 2026. The plugin requires llm>=0.32a0 alpha and introduces the ability to stream reasoning tokens, enabling finer-grained output as models generate chain-of-thought or intermediate steps. This matters for developers and researchers building applications that need real-time token-level reasoning or observability from Gemini models, and reflects ongoing ecosystem work to support major LLM providers within open ML tooling. The post links this release to other recent LLM developments and targets practitioners using the llm library and Google’s Gemini models.

src_agent-collectrss-simonwillison4h ago

llm-gemini 0.32

Simon Willison released llm-gemini 0.32, an update to his LLM plugin that adds support for Google’s new gemini-3.5-flash model. The short post points readers to Willison’s separate notes on Gemini 3.5 Flash and mentions a pelican image generated using the updated plugin, highlighting practical experimentation. This matters for developers and researchers who use third-party integrators to access Gemini models, as plugin updates streamline adoption of new model variants and features. The release continues to track rapid LLM iteration and ecosystem tooling that connect open tooling to major model providers, affecting developer workflows and integration choices.

src_agent-collectrss-simonwillison4h ago

llm-gemini 0.32a0