Home OpenAI Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

OpenAI

Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

adminUpdated 4 months Ago1 Mins read45 Views

Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

Google has introduced Gemini 2.5 Flash, an early-preview AI model accessible via the Gemini API through Google AI Studio and Vertex AI. This model builds upon the foundation of Gemini 2.0 Flash, offering enhanced reasoning capabilities while maintaining a focus on speed and cost-efficiency.

Hybrid Reasoning with Adjustable Thinking Budgets

A key feature of Gemini 2.5 Flash is its hybrid reasoning capability, allowing developers to enable or disable the model’s “thinking” process. This process involves the model reasoning through its thoughts before generating a response, which can be beneficial for complex tasks requiring multiple steps of reasoning, such as solving math problems or analyzing research questions.

To provide flexibility, developers can set a “thinking budget” that controls the maximum number of tokens the model can generate during its thinking phase. A higher budget permits more extensive reasoning, potentially improving the quality of responses for complex prompts. Importantly, the model does not use the full budget if the prompt does not necessitate it, ensuring efficiency for simpler tasks.

Performance and Cost Considerations

Gemini 2.5 Flash maintains the fast speeds of its predecessor, Gemini 2.0 Flash, even when the thinking process is disabled. This design allows developers to optimize for latency and cost when high-level reasoning is unnecessary. By adjusting the thinking budget, developers can find the appropriate balance between response quality, cost, and latency to suit their specific application needs.

Integration and Accessibility

The model is currently available in preview through Google AI Studio and Vertex AI. Developers can experiment with Gemini 2.5 Flash by accessing it via the Gemini API, enabling them to build and test applications that leverage the model’s hybrid reasoning capabilities.

Check out the Technical details. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 90k+ ML SubReddit.

🔥 [Register Now] miniCON Virtual Conference on AGENTIC AI: FREE REGISTRATION + Certificate of Attendance + 4 Hour Short Event (May 21, 9 am- 1 pm PST) + Hands on Workshop

Sana Hassan, a consulting intern at Marktechpost and dual-degree student at IIT Madras, is passionate about applying technology and AI to address real-world challenges. With a keen interest in solving practical problems, he brings a fresh perspective to the intersection of AI and real-life solutions.

Source link

Previous post A Hands-On Tutorial: Build a Modular LLM Evaluation Pipeline with Google Generative AI and LangChain

IBM Releases Granite 3.3 8B: A New Speech-to-Text (STT) Model that Excels in Automatic Speech Recognition (ASR) and Automatic Speech Translation (AST)

Next post IBM Releases Granite 3.3 8B: A New Speech-to-Text (STT) Model that Excels in Automatic Speech Recognition (ASR) and Automatic Speech Translation (AST)

SEA-LION v4: Multimodal Language Modeling for Southeast Asia

AI Singapore (AISG) has released SEA-LION v4, an open-source multimodal language model...

admin3 Mins read

OpenAI

How to Implement the LLM Arena-as-a-Judge Approach to Evaluate Large Language Model Outputs

In this tutorial, we will explore how to implement the LLM Arena-as-a-Judge...

admin4 Mins read

OpenAI

How Do GPUs and TPUs Differ in Training Large Transformer Models? Top GPUs and TPUs with Benchmark

Both GPUs and TPUs play crucial roles in accelerating the training of...

admin4 Mins read

OpenAI

Google AI Introduced Guardrailed-AMIE (g-AMIE): A Multi-Agent Approach to Accountability in Conversational Medical AI

Recent advances in large language model (LLM)-powered diagnostic AI agents have yielded...

admin3 Mins read

This Week

Bots Are Taking Over the Internet—And They’re Not Asking for Permission

Zhipu AI Unveils ComputerRL: An AI Framework Scaling End-to-End Reinforcement Learning for Computer Use Agents

Unfiltered Roleplay AI Chatbots with Pictures – My Top Picks

Weekly Newsletter

Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

Hybrid Reasoning with Adjustable Thinking Budgets

Performance and Cost Considerations

Integration and Accessibility

Leave a comment

Leave a Reply Cancel reply

Latest Posts

Zhipu AI Unveils ComputerRL: An AI Framework Scaling End-to-End Reinforcement Learning for Computer Use Agents

Unfiltered Roleplay AI Chatbots with Pictures – My Top Picks

Top 10 AI Blogs and News Websites for AI Developers and Engineers in 2025

AI-Powered Content Creation Gives Your Docs and Slides New Life

SEA-LION v4: Multimodal Language Modeling for Southeast Asia

How to Implement the LLM Arena-as-a-Judge Approach to Evaluate Large Language Model Outputs

How Do GPUs and TPUs Differ in Training Large Transformer Models? Top GPUs and TPUs with Benchmark

Google AI Introduced Guardrailed-AMIE (g-AMIE): A Multi-Agent Approach to Accountability in Conversational Medical AI

Get to Know Us

keep in touch