ARTICLE AD BOX
Google has expanded its family of
Gemini
2.5 of hybrid reasoning
AI models
. The company said that its
Gemini 2.5 Pro
and
Gemini 2.5 Flash
models are now generally available. Further, it released a preview of the new 2.5 Flash-Lite model which it claims is its most cost-efficient and fastest model yet."We designed Gemini 2.5 to be a family of hybrid reasoning models that provide amazing performance, while also being at the Pareto Frontier of cost and speed," Google stated in its announcement.
General availability of Gemini 2.5 Pro and Gemini 2.5 Flash models
The generally available versions of Gemini 2.5 Flash and 2.5 Pro are now ready for production applications, a move Google attributes to valuable developer feedback gathered over recent weeks. Adding to the lineup, Google has introduced a preview of Gemini 2.5 Flash-Lite, touted as its most cost-efficient and fastest 2.5 model to date.
"Gemini 2.5 Pro + 2.5 Flash are now stable and generally available. Plus, get a preview of Gemini 2.5 Flash-Lite, our fastest + most cost-efficient 2.5 model yet," Google CEO Sundar Pichai said in a post on X."Exciting steps as we expand our 2.5 series of hybrid reasoning models that deliver amazing performance at the Pareto frontier of cost and speed," he added.
Google says that this new version is designed to excel in high-volume, latency-sensitive tasks like translation and classification, offering lower latency than its predecessors, 2.0 Flash-Lite and 2.0 Flash, across a wide range of prompts.Despite its enhanced efficiency, 2.5 Flash-Lite retains the core capabilities that define the Gemini 2.5 family. These include the ability to adjust computational "thinking" based on budget, integrate with tools such as Google Search and code execution, support multimodal input (processing various data types), and offer a substantial 1-million-token context length, the company says. According to Google, the model also demonstrates "all-around higher quality" than 2.0 Flash-Lite across benchmarks in coding, math, science, reasoning, and multimodal tasks.Developers can access the preview of Gemini 2.5 Flash-Lite through
Google AI Studio
and Vertex AI, alongside the newly stable versions of 2.5 Flash and Pro. Both 2.5 Flash and Pro are also now accessible directly within the Gemini app. Furthermore, custom versions of 2.5 Flash-Lite and Flash have been integrated into Google Search.