Gemini 3.5 Flash is live. Google pushed it into the Gemini API, Google AI Studio, and Vertex AI simultaneously with the I/O 2026 keynote announcement on May 19, no waitlist, no preview tier. That’s a meaningful distribution move. Developers can start building with it today.
One benchmark number is independently confirmable from Google’s own blog: 78% on SWE-bench Verified. Google frames that as outperforming prior flagship models on coding benchmarks. SWE-bench Verified is a respected software engineering evaluation, not a trivial claim. The DeepMind technical page adds two more confirmed figures: a 42% improvement over Flash 3 on Google’s long-range, multi-turn cyber benchmark, and a 72% reduction in token usage on that same test.
The catch is that all of these are self-reported benchmarks. Google evaluated its own model. Independent validation from Epoch AI or comparable third parties isn’t available yet. The Wire’s original reporting included four additional benchmark scores, Terminal-Bench 2.1, GDPval-AA, MCP Atlas, CharXiv, that couldn’t be verified against accessible source content. Those numbers don’t appear here. The 78% SWE-bench figure and the cyber benchmark improvements are what the record supports.
Don’t expect a clear price card yet. Google states Gemini 3.5 Flash runs at less than half the cost of comparable models, but exact token pricing wasn’t fully published at announcement time. For teams comparing inference costs against GPT-4o class models, the “less than half” framing is useful directional signal, not a procurement number.
Disputed Claim
Why this matters for practitioners
Gemini 3.5 Flash is positioned in the tier below Gemini 4.0, faster, cheaper, optimized for coding and agentic workflows rather than frontier reasoning tasks. That use-case split matters for teams building production pipelines. If your workload is code generation, multi-step agent execution, or high-volume API calls where cost scales, this is the model tier to evaluate. If you need the strongest available reasoning, Gemini 4.0 is the relevant comparison.
Google also announced Antigravity 2.0, a standalone desktop application for multi-agent workflow orchestration, now globally available. TechCrunch’s coverage confirmed the orchestration-multiple-agents framing independently. The two launches are connected, a fast, cost-efficient API model paired with a desktop orchestration layer designed to coordinate it.
Context
Google reported the Gemini app has surpassed 900 million monthly active users, according to the company. That’s a vendor-disclosed figure without independent verification, but the scale underscores why the Gemini 3.5 Flash pricing story matters beyond enterprise developers, it’s Google’s bet on making capable AI cheap enough to sustain mass-market deployment.
What to Watch
What to watch
Epoch AI or a comparable independent evaluation body publishing benchmark results for Gemini 3.5 Flash is the trigger that either validates Google’s 78% SWE-bench claim or contextualizes it. Watch also for token pricing documentation on the Gemini API pricing page, Google typically publishes this within days of a model launch. If pricing lands below $0.10/1M tokens for input at standard tier, that’s a meaningful undercutting of current mid-tier pricing.
TJS synthesis
Gemini 3.5 Flash’s immediate API availability makes it the most actionable announcement from I/O 2026 for developers. The 78% SWE-bench Verified score is real data, treat it as a floor until independent evaluation reports. Wait for Epoch AI benchmarks before migrating production agentic coding pipelines, but start your evaluation environment now.