Home MarkTechPost Generating audio for video – Google DeepMind

MarkTechPost

Generating audio for video – Google DeepMind

adminUpdated 10 months Ago1 Mins read65 Views

Generating audio for video – Google DeepMind

Acknowledgements

This work was made possible by the contributions of: Ankush Gupta, Nick Pezzotti, Pavel Khrushkov, Tobenna Peter Igwe, Kazuya Kawakami, Mateusz Malinowski, Jacob Kelly, Yan Wu, Xinyu Wang, Abhishek Sharma, Ali Razavi, Eric Lau, Serena Zhang, Brendan Shillingford, Yelin Kim, Eleni Shaw, Signe Nørly, Andeep Toor, Irina Blok, Gregory Shaw, Pen Li, Scott Wisdom, Aren Jansen, Zalán Borsos, Brian McWilliams, Salah Zaiem, Marco Tagliasacchi, Ron Weiss, Manoj Plakal, Hakan Erdogan, John Hershey, Jeff Donahue, Vivek Kumar, and Matt Sharifi.

We extend our gratitude to Benigno Uria, Björn Winckler, Charlie Nash, Conor Durkan, Cătălina Cangea, David Ding, Dawid Górny, Drew Jaegle, Ethan Manilow, Evgeny Gladchenko, Felix Riedel, Florian Stimberg, Henna Nandwani, Jakob Bauer, Junlin Zhang, Luis C. Cobo, Mahyar Bordbar, Miaosen Wang, Mikołaj Bińkowski, Sander Dieleman, Will Grathwohl, Yaroslav Ganin, Yusuf Aytar, and Yury Sulsky.

Special thanks to Aäron van den Oord, Andrew Zisserman, Tom Hume, RJ Mical, Douglas Eck, Nando de Freitas, Oriol Vinyals, Eli Collins, Koray Kavukcuoglu and Demis Hassabis for their insightful guidance and support throughout the research process.

We also acknowledge the many other individuals who contributed across Google DeepMind and our partners at Google.

Source link

Previous post Google DeepMind at ICML 2024

Next post The Shift from Models to Compound AI Systems – The Berkeley Artificial Intelligence Research Blog

AlphaGenome: AI for better understanding the genome

Science Published 25 June 2025 Authors Ziga Avsec and Natasha Latysheva Introducing...

admin7 Mins read

MarkTechPost

Gemini Robotics On-Device brings AI to local robotic devices

We’re introducing an efficient, on-device robotics model with general-purpose dexterity and fast...

admin1 Mins read

MarkTechPost

Gemini 2.5 model family expands

We designed Gemini 2.5 to be a family of hybrid reasoning models...

admin1 Mins read

MarkTechPost

Gemini 2.5: Updates to our family of thinking models

Today we are excited to share updates across the board to our...

admin2 Mins read

This Week

How AI is Redefining the Music Industry

Google AI Releases Gemma 3n: A Compact Multimodal Model Built for Edge Deployment

Inception Labs Introduces Mercury: A Diffusion-Based Language Model for Ultra-Fast Code Generation

Weekly Newsletter

Generating audio for video – Google DeepMind

Leave a comment

Leave a Reply Cancel reply

Latest Posts

Google AI Releases Gemma 3n: A Compact Multimodal Model Built for Edge Deployment

Inception Labs Introduces Mercury: A Diffusion-Based Language Model for Ultra-Fast Code Generation

Google DeepMind Releases AlphaGenome: A Deep Learning Model that can more Comprehensively Predict the Impact of Single Variants or Mutations in DNA

Exploring Text-to-Speech Technology for Video Game Narration

AlphaGenome: AI for better understanding the genome

Gemini Robotics On-Device brings AI to local robotic devices

Gemini 2.5 model family expands

Gemini 2.5: Updates to our family of thinking models

Get to Know Us

keep in touch