Remember that lull from the past few months where we didn’t get the surprise announcement of another new “revolutionary” AI video model every single day? Well, that lull is over as the AI wars are heating up once again.

However, with this AI revolution now several months and years old, it is admittedly getting harder to keep up with what every model is offering these days. We’re also several years into generative AI video, and it still hasn’t taken over in any meaningful way.

Will this new Gemini Omni model from Google be any different? It sure wants to be, with its promise to “create anything from anything,” a promise that starts with video. But can it? Let’s take a look.


Google Introduces Gemini Omni

So, Google’s Gemini Omni Flash is here as a new AI model that serves as a follow-up to the company’s popular Nano Banana model. However, while Nano Banana focused on image generation and editing, the new Gemini Omni model focuses on video. Or, as the company puts it, video to start.

The real promise of Gemini Omni is that it is an AI model that can “create anything” from any input. If this sounds a bit vague to you, it does to us as well. In practice, it kind of just sounds like any other generative AI video model. Google reports that to use it, users will input images, audio, video, or text, and the model will generate high-quality videos grounded in its real-world knowledge.

From there, AI creators can then easily edit their videos through natural conversation, as you would with any other LLM-style video model.

Accurate Physics and Added Effects

From what we can tell, what makes this Gemini Omni model unique will be its ability to build scenes that look more realistic than those of its competitors.

It will be able to do this because, as Google cites, it has a deeper knowledge of history, science, and cultural context than its competitors,, which will allow it to have a more intuitive understanding of physics and other forces such as gravity, kinetic energy, and fluid dynamics.

Along with conversational editing, the other big selling point being pushed by Google for Gemini Omni is that it will allow users to define the visual language for their generation by applying styles, motion, and effects.

AI creators will also be able to create videos with their own voices by using digital avatars, which the company claims can help protect users from harm and govern the use of its AI tools.

Find Out More

As always, if AI isn’t your thing, you’re obviously under no obligation to explore any of these models or technologies any further. If you’re intrigued for any reason, though, you can find out more about how to try out Gemini Omni on Google’s page here.