Create and edit video from any input

Meet Gemini Omni Flash

Gemini Omni Flash is the first model in Google's Gemini Omni family, built to turn text, images, video, and audio references into high-quality videos with natural, conversational editing.

Explore Gemini Omni Flash

What Gemini Omni Flash Is Built For

⚡

Conversational Video Editing

Edit scenes with natural language, refine across multiple turns, and keep characters, motion, and context consistent as the idea evolves.

👁️

Multimodal References

Combine prompts, images, existing clips, drawings, style references, and audio cues into one cohesive video output.

📉

World-Aware Motion

Use Gemini's understanding of physics, objects, science, history, and culture to create videos that feel more grounded and coherent.

Gemini Omni Flash Overview

Category	Gemini Omni Flash	What It Means
Primary Use Case	AI video generation and editing	Create new clips or transform existing footage through prompts and references
Inputs	Text, images, video, and audio	Guide the model with scripts, sketches, footage, voice references, music, or mixed creative materials
Outputs	High-quality video with audio	Generate polished clips for storytelling, explainers, product shots, social content, and visual experiments

Frequently Asked Questions

What is Gemini Omni Flash?

Gemini Omni Flash is the first model in the Gemini Omni family. It combines Gemini's multimodal reasoning with generative media capabilities so users can create and edit video from text, image, video, and audio inputs.

What can it do with video?

It can create new scenes, apply reference images or motion, change camera angles, transform environments, add effects, and refine a video through step-by-step natural language instructions.

Where is Gemini Omni Flash available?

Google announced rollout through the Gemini app and Google Flow for Google AI subscribers, with availability also starting for YouTube Shorts and YouTube Create users. Developer and enterprise API access is planned after the initial launch. Content created or edited in supported Google products includes SynthID watermarking and C2PA Content Credentials.