guides

Guide to World Model Video Generation

A practical walkthrough that takes a rainy car-chase script from prompt to verified 12-second clip using Seedance 2.0, Veo 3.1 and reference-to-video corrections.

June 1, 2026
Guide to World Model Video Generation

TL;DR

Start with a 5-second Seedance 2.0 pass at 1080p, correct lighting and contact points with Reference to Video at 0.75 strength, then stitch segments in Video to Video. Verify gravity, reflections and object persistence before exporting the final 12-second 1080p file.

You need a 12-second chase scene where a car slides around a wet corner while headlights cast moving reflections on the pavement.

Prepare Your Assets

Start at the text prompt stage to lock in camera angle, rain intensity and car model. Use a 1080p reference still exported from your storyboard at the 0-second mark.

Link the first asset directly into the Text to Video tool. Set duration to 5 seconds and aspect ratio 16:9 so the initial pass stays under 20 credits.

Choose Reference Images

Add two stills: one at frame 0 and one at frame 72. These anchors force the model to respect the same curb height and puddle locations.

Select the Generation Model

Pick from four frontier options that each handle world-model constraints differently.

Model Max Duration Resolution Approx Credits Strength
Seedance 2.0 8 s 1080p 15 Strong motion continuity
Veo 3.1 10 s 1080p 18 Best lighting consistency
Kling 3.0 6 s 720p 12 Fast iteration
Sora 2 12 s 1080p 22 Complex object interaction

Run the first pass with Seedance 2.0. The output lands at 5 seconds and 1080p.

Run the First Pass

Paste the prompt: "silver sedan drifts left on rain-slick street at night, headlights sweep across puddles, camera tracks from passenger side".

Submit and wait 45 seconds. Download the clip and scrub frame by frame in your editor.

Check that the car’s shadow stays attached to the tires and that puddle reflections move with the headlights.

Refine with Reference Frames

If the rear wheel clips the curb at second 3, feed the clip into Reference to Video. Upload the original reference still at 0 seconds and the failed frame at 3 seconds.

Set strength to 0.75. Generate a 4-second correction segment. The new clip now matches the curb line exactly.

Merge the corrected segment with the first 3 seconds in Video to Video. Total credits used: 47.

Verify Physics Consistency

Play the stitched 12-second video at 0.25x speed. Confirm:

  • Gravity holds the car to the road surface
  • Water spray arcs match wheel speed
  • Headlight beams do not pass through the car body
  • Building reflections stay fixed to the architecture

If any element breaks, loop back to the reference step and adjust the prompt weight on "wet asphalt" by +0.2.

Export and Iterate

When the 12-second clip passes verification, export as 1080p H.264. The file is 48 MB.

Repeat the same workflow for the next shot using Image to Video with a new reference still taken from the end of this clip. The shared world state carries forward automatically.

Motion Poster lets you turn the final still into a looping 4-second teaser for social posts without extra credits.

Tools mentioned in this post

ai-videoworld-modelsgeneration-guidephysics-simulation

Ready to create with guides?

Jump straight into Flixly's AI studio and try guides with 50+ models — free to start.