Good Omni prompting comes down to one mindset shift: stop describing every pixel and start directing a collaborator that already understands the world. Brief it like you’d brief a talented cinematographer.
The 6 Dimensions (Google’s official framework)
Cover all six and Omni reasons about how they interact.
| Dimension | What it controls | Example terms |
|---|---|---|
| Subject / Action | Who/what, and what’s happening | characters, objects, motions |
| Setting / Location | Where the scene takes place | “misty forest,” “cyberpunk street” |
| Camera | Shot framing and movement | close-up, dolly zoom, orbit, locked off |
| Lighting | Source, quality, mood | golden hour, neon, soft diffused |
| Style | Visual aesthetic | photorealistic, claymation, anime |
| Audio | Sound design intent | ambient sound, synced effects, voiceover |
The formula: [SUBJECT] + [ACTION] + [SETTING] + [CAMERA] + [LIGHTING] + [STYLE] + [AUDIO]
Weak: “Generate a video of a coffee cup in a cafe.”
Strong: “A 10-second cinematic video in one continuous shot. A coffee cup on a worn wooden table, steam rising in dramatic backlighting. Camera starts close on the rim, slowly pulls back to reveal a rainy cafe window. Warm interior light against cool blue exterior. Photorealistic, shallow depth of field. Natural ambient cafe sounds, no music.”
The Opening-Line Trick
Your first sentence should lock duration, aspect ratio, style, and camera before any creative detail.
| Opening line | Best for |
|---|---|
A 10-second 16:9 cinematic video in one continuous shot. |
Product hero, cinematic |
A 10-second 9:16 photorealistic video in handheld vlog style. |
UGC, social, faceless narration |
A 10-second 1:1 animated explainer in flat-media style. |
Education, carousels |
A 10-second 16:9 video in locked-off static shot. |
Text-heavy, VFX, comparisons |
Duration rule of thumb: 4–5s hooks, 6–7s social ads, 8s showcases, 10s hard ceiling.
Camera Vocabulary (these are commands, not flavor)
