Home MarkTechPost Gemini 2.5’s native audio capabilities
MarkTechPost

Gemini 2.5’s native audio capabilities

Share
Gemini 2.5’s native audio capabilities
Share


Safety and responsibility

We’ve proactively assessed potential risks throughout every stage of the development process for these native audio features, using what we’ve learned to inform our mitigation strategies. We validate these measures through rigorous internal and external safety evaluations, including comprehensive red teaming for responsible deployment. Additionally, all audio outputs from our models are embedded with SynthID, our watermarking technology, to ensure transparency by making AI-generated audio identifiable.

Native audio capabilities for developers

We’re bringing native audio outputs to Gemini 2.5 models, giving developers new capabilities to build richer, more interactive applications via the Gemini API in Google AI Studio or Vertex AI.

To begin exploring, developers can try native audio dialog with Gemini 2.5 Flash preview in Google AI Studio’s stream tab. Controllable speech generation (TTS) is available in preview for both Gemini 2.5 Pro and Flash by selecting speech generation in the generate media tab within Google AI Studio.



Source link

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Gemini 2.5 model family expands
MarkTechPost

Gemini 2.5 model family expands

We designed Gemini 2.5 to be a family of hybrid reasoning models...

Gemini 2.5: Updates to our family of thinking models
MarkTechPost

Gemini 2.5: Updates to our family of thinking models

Today we are excited to share updates across the board to our...

combining generative AI with live-action filmmaking
MarkTechPost

combining generative AI with live-action filmmaking

Today, Eliza McNitt’s short film, “ANCESTRA,” premieres at the Tribeca Festival. It’s...

Fuel your creativity with new generative media models and tools
MarkTechPost

Fuel your creativity with new generative media models and tools

Today, we’re announcing our newest generative media models, which mark significant breakthroughs....