Home OpenAI Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.
OpenAI

Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

Share
Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.
Share


Google has introduced Gemini 2.5 Flash, an early-preview AI model accessible via the Gemini API through Google AI Studio and Vertex AI. This model builds upon the foundation of Gemini 2.0 Flash, offering enhanced reasoning capabilities while maintaining a focus on speed and cost-efficiency.​

Hybrid Reasoning with Adjustable Thinking Budgets

A key feature of Gemini 2.5 Flash is its hybrid reasoning capability, allowing developers to enable or disable the model’s “thinking” process. This process involves the model reasoning through its thoughts before generating a response, which can be beneficial for complex tasks requiring multiple steps of reasoning, such as solving math problems or analyzing research questions.​

To provide flexibility, developers can set a “thinking budget” that controls the maximum number of tokens the model can generate during its thinking phase. A higher budget permits more extensive reasoning, potentially improving the quality of responses for complex prompts. Importantly, the model does not use the full budget if the prompt does not necessitate it, ensuring efficiency for simpler tasks.​

Performance and Cost Considerations

Gemini 2.5 Flash maintains the fast speeds of its predecessor, Gemini 2.0 Flash, even when the thinking process is disabled. This design allows developers to optimize for latency and cost when high-level reasoning is unnecessary. By adjusting the thinking budget, developers can find the appropriate balance between response quality, cost, and latency to suit their specific application needs.​

Integration and Accessibility

The model is currently available in preview through Google AI Studio and Vertex AI. Developers can experiment with Gemini 2.5 Flash by accessing it via the Gemini API, enabling them to build and test applications that leverage the model’s hybrid reasoning capabilities.​


Check out the Technical details. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 90k+ ML SubReddit.

🔥 [Register Now] miniCON Virtual Conference on AGENTIC AI: FREE REGISTRATION + Certificate of Attendance + 4 Hour Short Event (May 21, 9 am- 1 pm PST) + Hands on Workshop


Sana Hassan, a consulting intern at Marktechpost and dual-degree student at IIT Madras, is passionate about applying technology and AI to address real-world challenges. With a keen interest in solving practical problems, he brings a fresh perspective to the intersection of AI and real-life solutions.



Source link

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

By submitting this form, you are consenting to receive marketing emails and alerts from: techaireports.com. You can revoke your consent to receive emails at any time by using the Unsubscribe link, found at the bottom of every email.

Latest Posts

Related Articles
SEA-LION v4: Multimodal Language Modeling for Southeast Asia
OpenAI

SEA-LION v4: Multimodal Language Modeling for Southeast Asia

AI Singapore (AISG) has released SEA-LION v4, an open-source multimodal language model...

How to Implement the LLM Arena-as-a-Judge Approach to Evaluate Large Language Model Outputs
OpenAI

How to Implement the LLM Arena-as-a-Judge Approach to Evaluate Large Language Model Outputs

In this tutorial, we will explore how to implement the LLM Arena-as-a-Judge...

How Do GPUs and TPUs Differ in Training Large Transformer Models? Top GPUs and TPUs with Benchmark
OpenAI

How Do GPUs and TPUs Differ in Training Large Transformer Models? Top GPUs and TPUs with Benchmark

Both GPUs and TPUs play crucial roles in accelerating the training of...

Google AI Introduced Guardrailed-AMIE (g-AMIE): A Multi-Agent Approach to Accountability in Conversational Medical AI
OpenAI

Google AI Introduced Guardrailed-AMIE (g-AMIE): A Multi-Agent Approach to Accountability in Conversational Medical AI

Recent advances in large language model (LLM)-powered diagnostic AI agents have yielded...