© Copyright 2024 TechAiReports. All rights reserved.

Home MarkTechPost Updates to Gemini 2.5 from Google DeepMind

MarkTechPost

Updates to Gemini 2.5 from Google DeepMind

adminUpdated 2 months Ago1 Mins read22 Views

Share

Updates to Gemini 2.5 from Google DeepMind

Share

New Gemini 2.5 capabilities

Native audio output and improvements to Live API

Today, the Live API is introducing a preview version of audio-visual input and native audio out dialogue, so you can directly build conversational experiences, with a more natural and expressive Gemini.

It also allows the user to steer its tone, accent and style of speaking. For example, you can tell the model to use a dramatic voice when telling a story. And it supports tool use, to be able to search on your behalf.

You can experiment with a set of early features, including:

Affective Dialogue, in which the model detects emotion in the user’s voice and responds appropriately.
Proactive Audio, in which the model will ignore background conversations and know when to respond.
Thinking in the Live API, in which the model leverages Gemini’s thinking capabilities to support more complex tasks.

We’re also releasing new previews for text-to-speech in 2.5 Pro and 2.5 Flash. These have first-of-its-kind support for multiple speakers, enabling text-to-speech with two voices via native audio out.

Like Native Audio dialogue, text-to-speech is expressive, and can capture really subtle nuances, such as whispers. It works in over 24 languages and seamlessly switches between them.

Share

Identify content made with Google’s AI tools

Previous post Identify content made with Google’s AI tools

How Automation Is Quietly Reshaping Ecommerce Fulfillment

Next post How Automation Is Quietly Reshaping Ecommerce Fulfillment

Leave a comment

Leave a Reply Cancel reply

Latest Posts

AI Now Weaves Yarn Dreams into Digital Art

DeepMind

AI Now Weaves Yarn Dreams into Digital Art

admin3 Mins read

The Ultimate 2025 Guide to Coding LLM Benchmarks and Performance Metrics

OpenAI

The Ultimate 2025 Guide to Coding LLM Benchmarks and Performance Metrics

3 Mins read

Hypernatural Raises Eyebrows and Millions with Its Humanlike AI Video Creators—Is This the Next Hollywood Disruptor?

DeepMind

Hypernatural Raises Eyebrows and Millions with Its Humanlike AI Video Creators—Is This the Next Hollywood Disruptor?

3 Mins read

Top Local LLMs for Coding (2025)

OpenAI

Top Local LLMs for Coding (2025)

2 Mins read

Related Articles

Deep Think is now rolling out

Deep Think is now rolling out

How Deep Think works: extending Gemini’s parallel “thinking time” Just as people...

admin1 Mins read

AlphaEarth Foundations helps map our planet in unprecedented detail

AlphaEarth Foundations helps map our planet in unprecedented detail

Science Published 30 July 2025 Authors The AlphaEarth Foundations team New AI...

admin5 Mins read

Aeneas transforms how historians connect the past

Aeneas transforms how historians connect the past

Research Published 23 July 2025 Authors The Aeneas team Writing was everywhere...

admin5 Mins read

Gemini 2.5 Flash-Lite is now stable and generally available

Gemini 2.5 Flash-Lite is now stable and generally available

Today, we’re releasing the stable version of Gemini 2.5 Flash-Lite, our fastest...

admin2 Mins read