Google DeepMind · The Agentic Era
Gemini Flash 2.0
Google's workhorse: fast, cheap, multimodal. The Sonnet of the Gemini family.
By C.W. Jameson · Published 19 May 2026 · Last reviewed 19 May 2026
Flash 2.0 is what Google ships when they need Gemini capability at Haiku pricing. It handles images, audio, and long documents at a cost that makes real-time applications viable. The model does not push the reasoning frontier; it covers the median use case at a rate that embarrasses most competitors.
Field signature
Sub-second response on short text prompts.
Specifications
| Released | 2025-02 |
|---|---|
| Context window | 1,000,000 tokens |
| Pricing | $0.075 / $0.30 per million tokens |
| Modalities | text · image · audio · video |
| License | Commercial API only |
| Era | The Agentic Era |
Strengths
- Cost
- Speed
- Multimodal breadth
Weaknesses
- Reasoning depth
- Tool-use compared to Claude
Authentication markers
The fingerprints by which Gemini Flash 2.0 can be identified from its output alone.
| Tell | Meaning |
|---|---|
| Will answer about video timestamps without preprocessing. | Gemini Flash or Pro lineage. |
Notable works
- Default model for Google's AI Studio free tier
Market position
$0.075-$0.30 per million tokens
Partner offer
Gemini's long-context window has no peer at the time of this entry.
Try Gemini →Affiliate link — see disclosure.
Primary sources
From the Almanac shop
The Operator's Compendium
Every agent harness, every routing pattern, every cost trick. 90-page PDF.
$29 — Coming soon