Google DeepMind’s new AI models help robots perform physical tasks, even without training

Google DeepMind is launching two new AI models designed to help robots “perform a wider range of real-world tasks than ever before.” The first, called Gemini Robotics, is a vision-language-action model capable of understanding new situations, even if it hasn’t been trained on them. Gemini Robotics is built on Gemini 2.0, the latest version of…

Read More

DeepMind’s new AI generates soundtracks and dialogue for videos

DeepMind, Google’s AI research lab, says it’s developing AI tech to generate soundtracks for videos. In a post on its official blog, DeepMind says that it sees the tech, V2A (short for “video-to-audio”), as an essential piece of the AI-generated media puzzle. While plenty of orgs including DeepMind have developed video-generating AI models, these models…

Read More