Super Slow Motion in AI Video
Super slow motion uses AI frame synthesis to reach 720 fps and higher. This guide covers inputs, models, and workflows for Flixly users.

TL;DR
Super slow motion generates extreme frame rates by synthesizing new frames rather than stretching existing ones. Seedance 2.0 and Kling 3.0 produce 720-960 fps output from 4-20 second clips with slowdown multipliers up to 32x.
Super slow motion creates video playback at rates far above standard 24 or 30 fps by synthesizing new frames between existing ones. It differs from simple speed reduction because the added frames maintain motion continuity instead of showing stutter.
How frame interpolation works
AI video models analyze motion vectors across input frames. They predict pixel movement over time and insert intermediate frames at intervals as fine as 1 ms. Seedance 2.0 processes clips at 16 frames per second input and outputs up to 960 fps results.
Kling 3.0 applies optical flow estimation on 1080p footage. It requires a 4-second source clip minimum and returns a file with 4x temporal density.
Concrete inputs and outputs
Users upload an MP4 or MOV file between 5 and 30 seconds. Parameters include slowdown multiplier (4x to 32x) and target resolution such as 1080p or 4K. Output duration stretches accordingly while preserving original audio pitch when selected.
Veo 3.1 accepts reference video plus a text prompt describing the action. It generates 720 fps files sized around 120 MB for a 10-second source.
Real workflow examples
Sports highlight reels start with a 24 fps smartphone clip. The image to video tool converts a key frame then applies super slow settings to extend the play to 8 seconds at 480 fps.
Product launch videos use text to video to create a 3-second base shot of a falling object. A 16x slowdown produces 48 seconds of smooth descent.
Model comparison
| Model | Max fps | Input length | Typical credits | \ Resolution | \ Example use |
|---|---|---|---|---|---|
| Seedance 2.0 | 960 | 4-15 s | 18 | 1080p | Sports replays |
| Kling 3.0 | 720 | 5-20 s | 22 | 4K | Action sequences |
| Veo 3.1 | 720 | 3-12 s | 15 | 1080p | Product demos |
Where it shows up
Filmmakers apply super slow to dialogue-free sequences in shorts created via shorts generator. Motion designers combine it with video to video for stylized time effects on existing footage.
Start with a short test clip in the image to video tool to see frame count increase from 72 to 576 on a 3-second source.
FAQ
How many new frames does a 4x slowdown add to a 10-second 30 fps clip? A 4x slowdown on that clip creates 1170 total frames instead of the original 300, adding 870 synthesized frames.
Can super slow motion preserve original audio without pitch shift? Yes. Select the preserve pitch option in the lip sync or video effects settings before generation; the system time-stretches audio separately.
What file formats support super slow output above 240 fps? MP4 with H.264 or H.265 encoding works for 480 fps and higher. ProRes 422 is available on request for 960 fps files.
Does reference video quality affect final super slow results? Higher bitrate sources above 20 Mbps reduce artifacts in generated frames when using reference to video.
How long does a 16x slowdown generation take on a 5-second clip? Average processing completes in 45 seconds for 1080p output using Wan 2.7 or Seedance 2.0 on the current dashboard queue.
Selecting source clips for interpolation
Start by reviewing the original footage frame by frame in a standard editor before upload. Look for consistent lighting across the sequence and minimal motion blur on fast-moving subjects. Clips shot at 60 fps or higher give the model more reliable motion vectors than 24 fps phone footage. Avoid handheld shots with rolling shutter; the resulting skew creates ghosting once frames are synthesized at 8x or greater slowdowns.
Keep the action centered in the frame when possible. Peripheral movement near the edges often produces stretching artifacts because the model has fewer neighboring pixels to reference. If the subject exits the frame during the clip, trim the source to the portion where it remains fully visible. This reduces the number of frames the system must invent at the boundaries.
Color grade the source to a neutral log profile before submission if your camera allows. Flat footage with preserved highlight and shadow detail gives the interpolation algorithm more latitude when predicting intermediate pixel values. Export the graded clip at the highest bitrate your storage allows, then upload directly.
Handling artifacts in generated frames
Even with clean input, certain artifacts appear at extreme slowdown multipliers. Temporal flickering shows up most often on fine textures such as fabric or foliage. Reduce it by lowering the target fps from 960 to 480 or by adding a light temporal smooth pass in post after download.
Edge tearing occurs when an object moves against a high-contrast background. One practical fix is to run a second pass at half the slowdown factor and blend the two outputs using an opacity ramp on the areas showing tears. This hybrid approach keeps file size manageable while restoring clean edges.
Motion blur mismatch appears when the original clip was shot with a fast shutter. The synthesized frames lack corresponding blur, making movement look staccato. Apply a directional blur filter set to 20-30 percent of the slowdown multiplier in a node-based compositor to match the look of the source frames.
Combining super slow with other effects
Apply video to video after the initial slowdown generation when you need to add camera shake or lens distortion that matches the rest of the sequence. Feed the interpolated clip back into the tool with a prompt specifying the desired lens characteristics and a low strength value so the timing stays intact.
For product shots, layer the super slow output with a separate text to video pass that generates floating particles or reflections. Composite the two layers in your editor using the alpha channel from the particle pass. This keeps the main subject motion smooth while adding depth without re-interpolating the entire scene.
When building a highlight reel, export the super slow segment at 1080p first, then import it into the shorts generator for final pacing adjustments and text overlays. The generator respects the existing frame rate, so the added timing markers align correctly with the 480 fps or 720 fps timeline.
Post-generation checklist
- Verify frame count matches the expected output (original frames × slowdown factor).
- Check audio sync if pitch preservation was enabled; adjust stretch markers if drift exceeds one frame.
- Run a quick optical flow analysis pass to flag any remaining duplicated vectors.
- Export a 10-second proxy at 50 percent resolution for client review before committing to the full 4K master.
- Archive the original source clip alongside the interpolated version with matching metadata tags for future revisions.
Batch processing multiple clips
When handling more than one source file, queue uploads through the batch upload panel rather than individual submissions. Set a uniform slowdown multiplier across the group to keep timeline alignment simple once files land in the editor. The system processes clips sequentially, so stagger start times by 30 seconds if your account queue shows high load.
Name files with embedded metadata such as original fps and intended slowdown factor before upload. This prevents mix-ups when downloading the expanded results. A 12-second 60 fps clip at 8x produces 5760 frames; label it accordingly so the naming convention matches the output directory structure.
Monitor credit usage per batch. A 15-clip set at 16x typically consumes 240–280 credits depending on resolution. Pause the queue after the first three files finish to inspect one sample for artifact patterns before committing the remainder.
Export settings for different platforms
Choose container and codec based on the final delivery channel. Social platforms cap effective playback at 60 fps, so export the interpolated file at native high frame rate then conform down in your editor to avoid re-encoding artifacts. Retain the full frame count only for archival masters.
Use this reference when selecting output options:
| Platform | Recommended container | Max effective fps | Notes |
|---|---|---|---|
| Instagram Reel | MP4 H.264 | 60 | Conform after export |
| YouTube | MP4 H.265 | 60 | Upload original high-fps master |
| TikTok | MP4 H.264 | 60 | Avoid ProRes due to upload limits |
| Vimeo | MP4 H.265 | 120 | Supports higher frame rates |
Apply a two-pass encode for files above 480 fps to reduce banding on gradients. Keep the audio track separate during export if pitch preservation was enabled, then re-sync in post.
Audio synchronization techniques
When pitch preservation is disabled, stretch the original audio track manually using a 0.1-frame accuracy marker in a DAW. Align the first transient of the source audio with frame one of the interpolated video, then apply linear time stretch to match total duration.
For clips containing dialogue, generate a separate low-fps reference pass at 2x slowdown. Use this reference to place syllable markers before applying the full 16x or 32x interpolation. The low-fps version reveals mouth shapes that disappear at extreme slowdowns.
Export the final mix as a 48 kHz WAV alongside the video file. Import both into video to video for any last timing tweaks without re-running the full interpolation pipeline.
Version control for generated assets
Store the original source, the first-pass interpolated file, and any post-processed versions in a single project folder. Append suffixes such as _src, _interp_16x, and _final_v2 to maintain traceability. Update a shared CSV log with frame counts, credit totals, and export timestamps for each iteration.
When revisions are requested, reload the source clip into the reference to video tool rather than the already-interpolated file. This avoids compounding artifacts from multiple generations. Keep the CSV log in the same directory so metadata remains attached to the asset set.
Frequently Asked Questions
How many new frames does a 4x slowdown add to a 10-second 30 fps clip?▾
A 4x slowdown on that clip creates 1170 total frames instead of the original 300, adding 870 synthesized frames.
Can super slow motion preserve original audio without pitch shift?▾
Yes. Select the preserve pitch option in the lip sync or video effects settings before generation; the system time-stretches audio separately.
What file formats support super slow output above 240 fps?▾
MP4 with H.264 or H.265 encoding works for 480 fps and higher. ProRes 422 is available on request for 960 fps files.
Does reference video quality affect final super slow results?▾
Higher bitrate sources above 20 Mbps reduce artifacts in generated frames when using reference to video.
How long does a 16x slowdown generation take on a 5-second clip?▾
Average processing completes in 45 seconds for 1080p output using Wan 2.7 or Seedance 2.0 on the current dashboard queue.


