Guide to World Model Video Generation
A practical walkthrough that takes a rainy car-chase script from prompt to verified 12-second clip using Seedance 2.0, Veo 3.1 and reference-to-video corrections.

TL;DR
Start with a 5-second Seedance 2.0 pass at 1080p, correct lighting and contact points with Reference to Video at 0.75 strength, then stitch segments in Video to Video. Verify gravity, reflections and object persistence before exporting the final 12-second 1080p file.
You need a 12-second chase scene where a car slides around a wet corner while headlights cast moving reflections on the pavement.
Prepare Your Assets
Start at the text prompt stage to lock in camera angle, rain intensity and car model. Use a 1080p reference still exported from your storyboard at the 0-second mark.
Link the first asset directly into the Text to Video tool. Set duration to 5 seconds and aspect ratio 16:9 so the initial pass stays under 20 credits.
Choose Reference Images
Add two stills: one at frame 0 and one at frame 72. These anchors force the model to respect the same curb height and puddle locations.
Select the Generation Model
Pick from four frontier options that each handle world-model constraints differently.
| Model | Max Duration | Resolution | Approx Credits | Strength |
|---|---|---|---|---|
| Seedance 2.0 | 8 s | 1080p | 15 | Strong motion continuity |
| Veo 3.1 | 10 s | 1080p | 18 | Best lighting consistency |
| Kling 3.0 | 6 s | 720p | 12 | Fast iteration |
| Sora 2 | 12 s | 1080p | 22 | Complex object interaction |
Run the first pass with Seedance 2.0. The output lands at 5 seconds and 1080p.
Run the First Pass
Paste the prompt: "silver sedan drifts left on rain-slick street at night, headlights sweep across puddles, camera tracks from passenger side".
Submit and wait 45 seconds. Download the clip and scrub frame by frame in your editor.
Check that the car’s shadow stays attached to the tires and that puddle reflections move with the headlights.
Refine with Reference Frames
If the rear wheel clips the curb at second 3, feed the clip into Reference to Video. Upload the original reference still at 0 seconds and the failed frame at 3 seconds.
Set strength to 0.75. Generate a 4-second correction segment. The new clip now matches the curb line exactly.
Merge the corrected segment with the first 3 seconds in Video to Video. Total credits used: 47.
Verify Physics Consistency
Play the stitched 12-second video at 0.25x speed. Confirm:
- Gravity holds the car to the road surface
- Water spray arcs match wheel speed
- Headlight beams do not pass through the car body
- Building reflections stay fixed to the architecture
If any element breaks, loop back to the reference step and adjust the prompt weight on "wet asphalt" by +0.2.
Export and Iterate
When the 12-second clip passes verification, export as 1080p H.264. The file is 48 MB.
Repeat the same workflow for the next shot using Image to Video with a new reference still taken from the end of this clip. The shared world state carries forward automatically.
Motion Poster lets you turn the final still into a looping 4-second teaser for social posts without extra credits.



