Home MarkTechPost Updates to Gemini 2.5 from Google DeepMind
MarkTechPost

Updates to Gemini 2.5 from Google DeepMind

Share
Updates to Gemini 2.5 from Google DeepMind
Share


New Gemini 2.5 capabilities

Native audio output and improvements to Live API

Today, the Live API is introducing a preview version of audio-visual input and native audio out dialogue, so you can directly build conversational experiences, with a more natural and expressive Gemini.

It also allows the user to steer its tone, accent and style of speaking. For example, you can tell the model to use a dramatic voice when telling a story. And it supports tool use, to be able to search on your behalf.

You can experiment with a set of early features, including:

  • Affective Dialogue, in which the model detects emotion in the user’s voice and responds appropriately.
  • Proactive Audio, in which the model will ignore background conversations and know when to respond.
  • Thinking in the Live API, in which the model leverages Gemini’s thinking capabilities to support more complex tasks.

We’re also releasing new previews for text-to-speech in 2.5 Pro and 2.5 Flash. These have first-of-its-kind support for multiple speakers, enabling text-to-speech with two voices via native audio out.

Like Native Audio dialogue, text-to-speech is expressive, and can capture really subtle nuances, such as whispers. It works in over 24 languages and seamlessly switches between them.



Source link

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

By submitting this form, you are consenting to receive marketing emails and alerts from: techaireports.com. You can revoke your consent to receive emails at any time by using the Unsubscribe link, found at the bottom of every email.

Latest Posts

Related Articles
Experiment with Gemini 2.0 Flash native image generation
MarkTechPost

Experiment with Gemini 2.0 Flash native image generation

In December we first introduced native image output in Gemini 2.0 Flash...

Gemini Robotics brings AI into the physical world
MarkTechPost

Gemini Robotics brings AI into the physical world

Models Published 12 March 2025 Authors Carolina Parada Introducing Gemini Robotics, our...

Our newest Gemini model with thinking
MarkTechPost

Our newest Gemini model with thinking

Last updated March 26 Today we’re introducing Gemini 2.5, our most intelligent...

Evaluating potential cybersecurity threats of advanced AI
MarkTechPost

Evaluating potential cybersecurity threats of advanced AI

Artificial intelligence (AI) has long been a cornerstone of cybersecurity. From malware...