# Complete Guide to Writing Prompts for Sora
**TL;DR:** Effective Sora prompts follow a structure: subject description + camera work + lighting + motion + style reference. Understanding cinematography terminology and how Sora interprets temporal information separates amateur results from professional-quality AI videos.
## Understanding How Sora Thinks
Sora operates fundamentally differently from image generators like Midjourney or DALL-E. Where image models interpret static visual compositions, Sora must understand motion physics, temporal consistency, camera behavior, and how scenes evolve across time. This requires prompt writers to think like directors rather than photographers.
### Key Differences in Sora's Processing
**Temporal Awareness:** Sora tracks how objects move through space over time, requiring descriptions that account for beginning, middle, and end states of motion.
**Physics Simulation:** The model understands gravity, inertia, collision, and natural movement patterns. Prompts that violate physics produce unstable results.
**Camera as Character:** Unlike image generators where framing is implicit, Sora requires explicit camera direction. The camera's movement is as important as subject movement.
**Lighting Continuity:** Sora maintains consistent lighting across frames. Describing light sources and their behavior prevents flickering or inconsistent illumination.
**Spatial Relationships:** Objects maintain position relative to each other. Describing spatial arrangements prevents subjects from phasing through each other or changing position unnaturally.
## The Five Pillars of Sora Prompt Structure
### Pillar 1: Subject Description
The foundation of any Sora prompt is precise subject description. Generic terms produce generic results. Specific details give Sora the information needed to render convincing video.
**Effective Subject Descriptions Include:**
**Physical Characteristics:** Age, gender, build, distinctive features, clothing details, hair style and color, facial features, posture, and body language.
**Spatial Position:** Where subjects exist in frame (foreground, background, center, left, right), their relationship to each other and to environment elements.
**Quantity and Scale:** How many subjects appear, their size relative to frame and surroundings, creating sense of scale through comparison.
**Material Properties:** Texture, reflectivity, opacity, and surface qualities that affect how light interacts with subjects.
**State and Condition:** New or aged, clean or weathered, static or dynamic, pristine or damaged condition.
**Example Weak Subject Description:**
"A woman walking in a city"
**Example Strong Subject Description:**
"Woman in her late 20s wearing black leather jacket and jeans, shoulder-length dark hair moving with her stride, walking through downtown area with glass office buildings visible behind her, autumn leaves scattered on sidewalk"
The strong version gives Sora specific visual anchors: age range, clothing details, hair length and behavior, environmental context, seasonal indicators. Each detail constrains the generation space toward your intended vision.
### Pillar 2: Camera Language
Professional cinematography uses specific terminology to describe camera position, movement, and framing. Sora responds best to these established terms because they carry precise meaning about how video should look.
**Essential Camera Terminology:**
**Shot Types:**
- **Extreme Wide Shot (EWS):** Subject small in frame, emphasizing environment
- **Wide Shot (WS):** Full body visible with surrounding context
- **Medium Shot (MS):** Waist up, balanced between subject and environment
- **Close-Up (CU):** Head and shoulders, emphasis on facial expression
- **Extreme Close-Up (ECU):** Specific detail fills frame, eyes or hands
**Camera Movements:**
- **Static/Locked:** Camera remains motionless, fixed perspective
- **Pan:** Horizontal rotation left or right on fixed axis
- **Tilt:** Vertical rotation up or down on fixed axis
- **Dolly/Track:** Camera moves smoothly forward or backward
- **Truck:** Camera moves left or right parallel to subject
- **Pedestal:** Camera moves up or down vertically
- **Crane:** Sweeping movement combining multiple axes
- **Handheld:** Unstabilized, organic human movement
- **Steadicam:** Smooth handheld, fluid motion without shake
**Camera Angles:**
- **Eye Level:** Camera at subject's eye height, neutral perspective
- **High Angle:** Camera above subject looking down, minimizing
- **Low Angle:** Camera below subject looking up, empowering
- **Bird's Eye View:** Directly overhead, planning perspective
- **Dutch Angle:** Tilted horizon, creates unease or dynamism
**Example Camera Language in Practice:**
**Weak Prompt:** "Video of person talking"
**Strong Prompt:** "Medium close-up shot of person talking directly to camera, eye level angle, camera slowly pushing in throughout shot, shallow depth of field"
The strong version specifies exact framing (medium close-up), angle (eye level), movement (push in), and depth (shallow). These technical terms tell Sora precisely how to construct the shot.
### Pillar 3: Lighting Description
Lighting determines mood, guides attention, reveals form, and separates professional from amateur video. Sora needs explicit lighting direction to create cinematic results.
**Key Lighting Concepts:**
**Quality of Light:**
- **Hard Light:** Direct, creates sharp shadows, high contrast
- **Soft Light:** Diffused, gentle shadows, low contrast
- **Ambient Light:** Overall environmental illumination
**Direction of Light:**
- **Front Light:** Source behind camera, flat illumination
- **Side Light:** Light from 90 degrees, reveals texture and form
- **Backlight:** Source behind subject, rim lighting effect
- **Top Light:** Overhead source, dramatic downward shadows
- **Bottom Light:** Upward source, unnatural and eerie
**Time of Day:**
- **Golden Hour:** Warm, soft light near sunrise/sunset
- **Blue Hour:** Cool twilight before sunrise/after sunset
- **Midday:** Harsh overhead sun, high contrast
- **Night:** Artificial sources, high contrast, colored light
**Color Temperature:**
- **Warm:** Orange/yellow tones, inviting and comfortable
- **Cool:** Blue tones, clinical or melancholic
- **Mixed:** Multiple color temperatures in same frame
**Atmospheric Effects:**
- **Mist/Fog:** Softens light, adds depth and atmosphere
- **Rain:** Reflects lights, creates bokeh and texture
- **Dust Particles:** Visible light beams, adds dimensionality
- **Smoke:** Diffuses light, adds drama and mystery
**Example Lighting Descriptions:**
**Weak:** "Good lighting"
**Strong:** "Soft natural window light from the left creating gentle shadows on the right side of face, warm afternoon color temperature, dust particles visible in light beams"
**Another Strong Example:** "Harsh single-source overhead fluorescent lighting, cool color temperature, deep shadows under eyes and chin, clinical institutional atmosphere"
### Pillar 4: Motion and Physics
Describing how subjects and camera move through space and time is where Sora diverges most from image generators. Motion must be physically plausible and clearly described.
**Types of Motion to Describe:**
**Subject Motion:**
- **Speed:** Slow, walking pace, running, rapid
- **Direction:** Forward, backward, left, right, diagonal, circular
- **Quality:** Smooth, jerky, fluid, mechanical, organic
- **Acceleration:** Starting, stopping, speeding up, slowing down
**Environmental Motion:**
- Wind effects on hair, clothing, vegetation
- Water movement, waves, currents, splashes
- Vehicle traffic, crowds, background activity
- Natural phenomena like falling leaves, flying birds
**Combined Subject and Camera Motion:**
- **Tracking shot:** Camera follows subject maintaining distance
- **Leading shot:** Camera precedes subject showing destination
- **Orbiting shot:** Camera circles around static subject
- **Parallax:** Subject and background move at different rates
**Temporal Effects:**
- **Real-time:** Normal speed matching reality
- **Slow-motion:** Slowed for emphasis or drama
- **Time-lapse:** Compressed time showing change
- **Speed ramping:** Changing speed within single shot
**Physics Considerations:**
Sora simulates realistic physics. Descriptions that violate physics produce unstable or uncanny results.
- **Respect Gravity:** Objects fall downward unless supported, projectiles arc
- **Maintain Momentum:** Moving objects continue moving, stopping requires force
- **Collision Response:** Objects interact physically, cannot pass through each other
- **Weight Behavior:** Heavy objects move differently than light ones
- **Material Properties:** Rigid vs flexible, solid vs liquid behavior
**Example Motion Descriptions:**
**Weak:** "Person moving fast"
**Strong:** "Runner accelerating from standing start, legs pumping, leaning forward into sprint, camera tracking alongside maintaining medium shot, slight handheld shake suggesting urgency"
**Another Strong Example:** "Leaf falling from tree, tumbling and spinning, wind causing unpredictable drift left then right, slow-motion capture showing rotation, landing gently on ground"
### Pillar 5: Style and Reference
Style references give Sora aesthetic shorthand, compressing complex visual qualities into recognizable terms. These references work because Sora was trained on vast video datasets including named cinematographers, film genres, and visual movements.
**Effective Reference Types:**
**Cinematographer Names:**
- **Roger Deakins:** Naturalistic, precise composition, controlled color palette
- **Emmanuel Lubezki:** Long takes, natural light, immersive camera work
- **Hoyte van Hoytema:** Large format, IMAX aesthetic, epic scale
- **Rachel Morrison:** Intimate realism, social consciousness, authentic texture
- **Robert Richardson:** Bold contrast, saturated color, operatic drama
**Film Genres:**
- **Film Noir:** High contrast, dramatic shadows, urban nights
- **Western:** Wide landscapes, golden dusty light, frontier aesthetic
- **Horror:** Dark, unsettling angles, suggestion over revelation
- **Romantic Comedy:** Warm tones, soft focus, optimistic lighting
- **Documentary:** Naturalistic, minimal intervention, authentic moments
**Decade-Specific Styles:**
- **1970s:** Warm film grain, earthy colors, naturalistic approach
- **1980s:** High contrast, vibrant color, stylized excess
- **1990s:** Clean, bright, music video influence
- **2000s:** Digital early adoption, desaturated palettes
- **2010s-Present:** Digital perfection, color grading dominance
**Technical Formats:**
- **35mm film:** Specific grain structure and color response
- **16mm film:** Grainier, documentary aesthetic
- **Super 8:** Home movie quality, nostalgic feel
- **Digital 4K/8K:** Clean, sharp, contemporary look
- **IMAX:** Large format, exceptional detail and scale
**Example Style References:**
**Weak:** "Make it look cinematic"
**Strong:** "Shot in the style of Roger Deakins, natural lighting with careful shadow control, precise composition, muted color palette with selective saturation"
**Another Strong Example:** "1970s documentary aesthetic, shot on 16mm film, natural lighting, handheld camera, slight grain, earthy color tones, observational filmmaking approach"
## Combining the Five Pillars: Complete Prompt Formula
The most effective Sora prompts integrate all five pillars into coherent descriptions. The formula:
**[SUBJECT] + [CAMERA] + [LIGHTING] + [MOTION] + [STYLE]**
### Example Building Process:
**Starting Point:** "A chef cooking"
**Add Subject Details:** "Chef in white uniform and toque, mid-40s with graying beard, working at stainless steel prep station, precise movements while chopping vegetables"
**Add Camera:** "Medium shot from slightly elevated angle, camera slowly pushing in to close-up of hands and cutting board, shallow depth of field with background kitchen softly blurred"
**Add Lighting:** "Bright kitchen lighting from overhead fixtures, some natural window light from the left creating soft highlights on knife blade and vegetables, clean professional kitchen atmosphere"
**Add Motion:** "Chef's hands moving rhythmically, knife chopping in consistent tempo, vegetables being pushed aside after cutting, smooth professional technique, real-time speed"
**Add Style:** "Shot in the style of Chef's Table documentary series, clean professional cinematography, emphasis on craft and precision, warm color grading suggesting culinary artistry"
**Complete Prompt:**
"Chef in white uniform and toque, mid-40s with graying beard, working at stainless steel prep station, precise movements while chopping vegetables. Medium shot from slightly elevated angle, camera slowly pushing in to close-up of hands and cutting board, shallow depth of field with background kitchen softly blurred. Bright kitchen lighting from overhead fixtures, some natural window light from left creating soft highlights on knife blade and vegetables, clean professional kitchen atmosphere. Chef's hands moving rhythmically, knife chopping in consistent tempo, vegetables being pushed aside after cutting, smooth professional technique, real-time speed. Shot in the style of Chef's Table documentary series, clean professional cinematography, emphasis on craft and precision, warm color grading suggesting culinary artistry."
This complete prompt gives Sora every element needed to generate professional-quality video matching your creative vision.
## Advanced Prompt Engineering Techniques
### Technique 1: Negative Space Control
Describing what's NOT in frame is as important as what is. Empty space creates composition and guides attention.
**Example:** "Wide shot with subject positioned in left third of frame, right two-thirds showing empty desert landscape, emphasizing isolation and scale"
### Technique 2: Depth Layering
Creating visual depth through foreground, midground, and background elements adds dimensionality.
**Example:** "Foreground: Rain-covered window with water droplets. Midground: Person visible through window, slightly out of focus. Background: Blurred city lights creating bokeh effect"
### Technique 3: Temporal Progression
Describing how the scene changes from beginning to end of clip gives Sora narrative structure.
**Example:** "Shot begins with empty street corner, subject enters frame from right side after 2 seconds, walks across frame examining phone, exits left side, camera holds on empty corner again showing their departure"
### Technique 4: Atmospheric Accumulation
Building atmosphere through multiple subtle details creates richer environments than single obvious effects.
**Example:** "Late afternoon coffee shop, dust particles visible in window light, steam rising from cups, muffled conversation ambiance suggested through body language, warm wood tones, lived-in comfortable space"
### Technique 5: Constrast and Juxtaposition
Placing opposing elements in same frame creates visual interest and narrative tension.
**Example:** "Modern glass skyscraper reflecting in puddle on cracked asphalt street, homeless person sleeping in foreground, wealth and poverty juxtaposed, documentary social commentary aesthetic"
### Technique 6: Sensory Suggestion
Even though Sora generates silent video, describing implied sounds and sensations enriches the output.
**Example:** "Busy kitchen with visible steam, chefs moving urgently, pots and pans in motion, implied heat and noise, chaotic energy of dinner rush service"
### Technique 7: Scale Anchoring
Using familiar objects to establish scale of unfamiliar subjects or environments.
**Example:** "Massive redwood tree, person standing at base looking up for scale, tree trunk wider than car, emphasizing ancient natural monumentality"
### Technique 8: Emotional Camera Work
Camera behavior can convey emotion and perspective beyond neutral observation.
**Example:** "Handheld camera following person through crowded party, slightly unstable movement suggesting intoxication or anxiety, subjective perspective creating empathy"
## Common Mistakes and How to Fix Them
### Mistake 1: Vague Action Verbs
**Wrong:** "Person doing something interesting"
**Right:** "Person juggling three flaming torches, hands moving in precise circular pattern, catching and throwing in rhythm"
### Mistake 2: Assuming Context
**Wrong:** "The usual office scene"
**Right:** "Modern open-plan office, rows of desks with computers, fluorescent overhead lighting, workers at cubicles, corporate environment"
### Mistake 3: Overloading Single Prompt
**Wrong:** 300-word prompt describing every possible detail
**Right:** Focus on 5-7 key visual elements that define the scene
### Mistake 4: Forgetting Temporal Element
**Wrong:** "Beautiful landscape"
**Right:** "Time-lapse of mountain landscape, clouds moving across sky, shadows shifting as sun moves, changing light throughout day"
### Mistake 5: Mixing Incompatible Styles
**Wrong:** "Shot like Wes Anderson but also documentary realism"
**Right:** Choose one coherent style: either "Wes Anderson symmetrical composition, pastel colors, precise framing" OR "handheld documentary realism, natural lighting, observational approach"
### Mistake 6: Ignoring Physics
**Wrong:** "Car driving underwater smoothly like on road"
**Right:** "Submersible vehicle moving through water, slower movement, bubbles trailing, underwater resistance visible"
### Mistake 7: No Lighting Direction
**Wrong:** "Well-lit scene"
**Right:** "Three-point lighting setup, key light from upper left, fill light from right softening shadows, backlight separating subject from background"
### Mistake 8: Static Everything
**Wrong:** "Person standing still"
**Right:** "Person standing in wind, hair and clothing moving, slight weight shifts, breathing visible, living stillness rather than frozen"
## Optimizing Prompts for Different Video Lengths
Sora generates various lengths. Adjust prompt complexity based on duration.
### Short Clips (3-5 seconds)
Focus on single strong visual moment. Simplified structure works best.
**Example:** "Balloon popping in slow-motion, rubber fragments flying outward, water droplets suspended in air, black background, high-speed camera capture"
### Medium Clips (5-10 seconds)
Allow for beginning, middle, end structure with clear progression.
**Example:** "Skateboarder approaching stairs, launching into air, board rotating beneath feet, successful landing, riding away, handheld following camera, urban setting"
### Longer Clips (10+ seconds)
Multiple beats, camera movements, or subject actions. More complex narrative possible.
**Example:** "Chef enters kitchen, ties apron, approaches prep station, examines ingredients, begins chopping vegetables with precise technique, camera slowly pushing in throughout, morning light through windows, professional kitchen atmosphere, Chef's Table style"
## Testing and Iteration Strategy
Prompt engineering requires systematic testing to understand what works.
### Step 1: Establish Baseline
Start with simple prompt capturing core idea.
**Example:** "Person walking through city at night"
### Step 2: Add One Element
Test adding camera movement.
**Example:** "Person walking through city at night, tracking shot following from behind"
### Step 3: Add Lighting
Layer in lighting description.
**Example:** "Person walking through city at night, tracking shot following from behind, neon signs reflecting on wet pavement"
### Step 4: Add Style
Include cinematic reference.
**Example:** "Person walking through city at night, tracking shot following from behind, neon signs reflecting on wet pavement, Blade Runner cinematography style"
### Step 5: Refine Details
Add specific subject and motion details.
**Example:** "Woman in black coat walking through Tokyo at night, tracking shot following from behind, neon signs reflecting on rain-soaked pavement, steam rising from street vents, Blade Runner cinematography style, cyberpunk atmosphere"
Each iteration teaches you which elements most impact your specific creative goals.
## Prompt Templates for Common Scenarios
### Product Showcase Template
"[Product name and key features], rotating on turntable, [lighting setup], clean [background color] background, [camera movement], macro detail visible, commercial product photography style, [brand aesthetic reference]"
### Interview/Talking Head Template
"[Person description] speaking to camera, [shot type] from [angle], [lighting description], [background environment], genuine expression, documentary portrait style, [mood/atmosphere]"
### Action Sports Template
"[Athlete description] [performing specific action], [camera following technique], [environment description], [lighting conditions], [speed: real-time/slow-motion], [sports cinematography reference], capturing athletic excellence"
### Nature Documentary Template
"[Animal/natural phenomenon], [behavior description], [camera position and movement], [natural lighting conditions], [environment context], [documentary style reference], educational and cinematic"
### Urban Exploration Template
"[Location type and specific details], [camera movement through space], [time of day], [atmospheric conditions], [architectural or cultural elements], [cinematography style], authentic sense of place"
## Prompt Variations for Creative Exploration
Sometimes the best results come from intentional variation of successful prompts.
### Original Prompt:
"Chef chopping vegetables, overhead shot, bright kitchen lighting, professional technique"
### Variation 1 - Different Angle:
"Chef chopping vegetables, close-up from side angle, bright kitchen lighting, professional technique"
### Variation 2 - Different Lighting:
"Chef chopping vegetables, overhead shot, dramatic single-source lighting, professional technique"
### Variation 3 - Different Motion:
"Chef chopping vegetables in slow-motion, overhead shot, bright kitchen lighting, professional technique"
### Variation 4 - Different Style:
"Chef chopping vegetables, overhead shot, bright kitchen lighting, professional technique, shot in gritty documentary realism style"
Each variation explores different creative directions while maintaining core concept.
## Troubleshooting Common Issues
### Issue: Inconsistent Results
**Solution:** Add more specific constraints. Generic prompts produce random variation. Detailed descriptions create consistency.
### Issue: Unnatural Motion
**Solution:** Check physics. Ensure described motion respects gravity, momentum, and material properties.
### Issue: Poor Composition
**Solution:** Be explicit about camera framing, subject position in frame, and negative space.
### Issue: Wrong Mood/Atmosphere
**Solution:** Add lighting details and style references. These strongly influence emotional tone.
### Issue: Unclear Subject
**Solution:** Increase subject description specificity. Include distinctive features and clear spatial relationships.
### Issue: Boring or Flat Results
**Solution:** Add camera movement, atmospheric effects, and cinematography references to elevate production value.
## Understanding Sora's Limitations
Knowing what Sora struggles with helps you work around limitations.
**Current Challenges:**
**Character Consistency:** Same character across multiple generations remains difficult. Use extremely detailed descriptions.
**Complex Physics:** Multiple interacting objects with collision may produce errors. Simplify interactions.
**Text Rendering:** On-screen text often illegible. Avoid prompts requiring readable text unless absolutely necessary.
**Long-Duration Consistency:** Extended clips may drift from initial prompt. Keep durations moderate.
**Precise Timing:** Exact second-by-second choreography difficult. Describe general progression rather than precise timing.
**Fine Detail:** Extreme close-ups of complex details (intricate jewelry, detailed textures) may lose clarity. Medium detail level works best.
## Ethical Considerations
Powerful tools require responsible use.
**Consider:**
**Deepfakes and Misrepresentation:** Don't create videos impersonating real people without permission.
**Copyright and IP:** Avoid generating content that infringes on copyrighted characters, logos, or trademarked properties.
**Misinformation:** Don't create realistic fake news or misleading documentary-style content.
**Exploitation:** Avoid generating content that exploits vulnerable people or promotes harmful stereotypes.
**Attribution:** Be transparent when sharing AI-generated content. Viewers deserve to know.
## Frequently Asked Questions
**How long should Sora prompts be?**
Optimal length is 50-150 words. This provides enough detail without overwhelming the model. Include subject, camera, lighting, motion, and style.
**Do I need cinematography knowledge to use Sora effectively?**
Basic terminology helps significantly but isn't required. Start with simple terms like "close-up," "wide shot," "from above" and expand vocabulary over time.
**Can I generate multi-scene videos with single prompt?**
Sora handles single coherent scenes best. For multi-scene videos, generate scenes separately and edit together in post-production software.
**How do I maintain character consistency across multiple clips?**
Use extremely detailed physical descriptions including distinctive features, clothing, and characteristics. Perfect consistency remains challenging.
**What if my results don't match expectations?**
Iterate systematically. Change one element at a time to isolate what affects output. Camera language and lighting typically have strongest impact.
**Should I mention specific camera equipment?**
Cinematographer names and film references work better than equipment specs. "Roger Deakins lighting" is more effective than "shot on ARRI Alexa."
**How do I make videos look more professional?**
Use proper cinematography terminology, detailed lighting descriptions, and style references. Professional language signals desired production quality.
**Can I create abstract or experimental content?**
Yes, but use concrete visual terms even for abstract concepts. "Swirling gradient colors transitioning from blue to orange, fluid motion, psychedelic patterns" works better than "abstract beauty."
**How important are style references?**
Very important. They compress complex aesthetic instructions into shorthand Sora understands from training data. Always include when possible.
**What's the biggest mistake beginners make?**
Forgetting to describe camera behavior. Sora needs explicit camera direction. Always specify static/moving, angle, and framing.
## Conclusion
Mastering Sora prompt engineering requires understanding that you're directing video, not describing images. The most effective prompts combine precise subject descriptions, professional camera language, detailed lighting specifications, clear motion direction, and appropriate style references into coherent creative instructions.
Start with the five-pillar framework: subject, camera, lighting, motion, style. Build prompts systematically, testing iterations to understand cause and effect. Study cinematography terminology to communicate precisely with Sora. Reference established visual languages through cinematographer names and film styles to achieve specific aesthetic goals.
Success comes from specificity balanced with clarity. Describe exactly what you want to see, how the camera should move, how light should behave, how subjects move through space, and what quality level you're targeting. Sora handles technical execution when given proper creative direction.
The gap between amateur and professional results isn't equipment or budget—it's understanding how to communicate your vision in language Sora comprehends. These principles and techniques provide that foundation. Now apply them to your creative projects.
**Ready to master Sora? Explore our complete prompt guides:**
- **[Best Sora Prompts](/best-sora-prompts/)** - 100+ tested prompts across all categories
- **[Sora Cinematic Prompts](/sora-cinematic-prompts/)** - Hollywood-style video techniques
- **[Fixing Sora's Biggest Issues](/fix-sora-problems/)** - Troubleshoot common problems
- **[Sora Prompt Engineering](/sora-prompt-engineering/)** - Advanced optimization strategies
- **[Sora Video Workflows](/sora-video-workflows/)** - Complete production processes