Gemini Omni Flash is the first model in Google's Gemini Omni family, built to turn text, images, video, and audio references into high-quality videos with natural, conversational editing.
Explore Gemini Omni FlashEdit scenes with natural language, refine across multiple turns, and keep characters, motion, and context consistent as the idea evolves.
Combine prompts, images, existing clips, drawings, style references, and audio cues into one cohesive video output.
Use Gemini's understanding of physics, objects, science, history, and culture to create videos that feel more grounded and coherent.
| Category | Gemini Omni Flash | What It Means |
|---|---|---|
| Primary Use Case | AI video generation and editing | Create new clips or transform existing footage through prompts and references |
| Inputs | Text, images, video, and audio | Guide the model with scripts, sketches, footage, voice references, music, or mixed creative materials |
| Outputs | High-quality video with audio | Generate polished clips for storytelling, explainers, product shots, social content, and visual experiments |