Continuing to bring you our latest models, with an improved Gemini 2.5 Flash and Flash-Lite release

Date:

Share post:

Today, we are releasing updated versions of Gemini 2.5 Flash and 2.5 Flash-Lite, available on Google AI Studio and Vertex AI, aimed at continuing to deliver better quality while also improving the efficiency.

Improvements in quality and speed for Gemini 2.5 Flash and 2.5 Flash Lite preview models compared to the current stable models

Output token efficiency

50% reduction in output tokens (hence costs) for Gemini 2.5 Flash-Lite and 24% reduction for Gemini 2.5 Flash

Updated Gemini 2.5 Flash-Lite

The latest version of Gemini 2.5 Flash-Lite was trained and built based on three key themes:

  • Better instruction following: The model is significantly better at following complex instructions and system prompts.
  • Reduced verbosity: It now produces more concise answers, a key factor in reducing token costs and latency for high-throughput applications (see charts above).
  • Stronger multimodal & translation capabilities: This update features more accurate audio transcription, better image understanding, and improved translation quality.

You can start testing this version today using the following model string: gemini-2.5-flash-lite-preview-09-2025.


Updated Gemini 2.5 Flash

This latest 2.5 Flash model comes with improvements in two key areas we heard consistent feedback on:

  • Better agentic tool use: We’ve improved how the model uses tools, leading to better performance in more complex, agentic and multi-step applications. This model shows noticeable improvements on key agentic benchmarks, including a 5% gain on SWE-Bench Verified, compared to our last release (48.9% → 54%).
  • More efficient: With thinking on, the model is now significantly more cost-efficient—achieving higher quality outputs while using fewer tokens, reducing latency and cost (see charts above).

We’re already seeing positive feedback from early testers. As Yichao ‘Peak’ Ji, Co-Founder & Chief Scientist at Manus, an autonomous AI agent, noted: “The new Gemini 2.5 Flash model offers a remarkable blend of speed and intelligence. Our evaluation on internal benchmarks revealed a 15% leap in performance for long-horizon agentic tasks. Its outstanding cost-efficiency enables Manus to scale to unprecedented levels—advancing our mission to Extend Human Reach.”

You can start testing this preview version today by using the following model string: gemini-2.5-flash-preview-09-2025.


Start building with Gemini

Over the last year, we’ve learned that shipping preview versions of our models allows you to test our latest improvements and innovations, provide feedback, and build production-ready experiences with the best of Gemini. Today’s releases are not intended to graduate to a new, stable version but will help us shape our future stable releases, and allow us to continue iterating and bring you the best of Gemini.

To make it even easier to access our latest models while also reducing the need to keep track of long model string names, we are also introducing a -latest alias for each model family. This alias always points to our most recent model versions, allowing you to experiment with new features without needing to update your code for each release. You can access the new previews using:

To ensure you have time to test new models, we will always provide a 2-week notice (via email) before we make updates or deprecate a specific version behind -latest. These are just model aliases so the rate limits, cost, and features available may fluctuate between releases.

For applications that require more stability, continue to use gemini-2.5-flash and gemini-2.5-flash-lite.

We continue to push the frontier of what is possible with Gemini and this release is just another step in that direction. We will have more to share soon, but in the meantime, happy building!

Source link

spot_img

Related articles

Oracle Rushes Patch for CVE-2025-61882 After Cl0p Exploited It in Data Theft Attacks

Oct 06, 2025Ravie LakshmananVulnerability / Threat Intelligence Oracle has released an emergency update to address a critical security flaw...

Podcast #838 – Snapdragon X2 Elite Extreme, Another Unlikely Intel Investor, CRT Revivals, Brave Security, EA Buyout + MORE!

Matching up the audio this week for a change of pace!  That Snapdragon X2 Elite Extreme sometimes compares...

Level Up Your Dev Game: The Jules API is Here!

Ready to help turbocharge your workflow? We’re excited to introduce the Jules API—a new...