Smart Shot vs Veo 3.1 Lite 2026
Side-by-side look at Smart Shot and Veo 3.1 Lite for 2026 video tasks. Covers specs, credit pricing, and practical workflow choices on Flixly.
TL;DR
Smart Shot creates controlled 4-8 second 1080p clips from a reference image and motion text at 4 credits. Veo 3.1 Lite creates 5-10 second 720p clips from text or image at 6 credits. Choose Smart Shot for precise camera moves over existing plates and Veo 3.1 Lite for text-driven scene invention.
What Smart Shot and Veo 3.1 Lite actually are
Smart Shot generates short video clips from a single reference frame plus motion prompts. Veo 3.1 Lite generates video from text or image prompts using Google's updated diffusion pipeline. The first tool focuses on controlled camera moves. The second prioritizes broad scene creation from text.
How each model processes requests
Smart Shot accepts a starting image, a motion description, and optional duration in seconds. It outputs 4-second to 8-second clips at 1080p. Veo 3.1 Lite takes text or an image and returns 5-second to 10-second clips at up to 720p by default.
Both run on Flixly's cluster. You select the model at generation time inside the dashboard.
Input specifications
- Smart Shot: PNG or JPG reference frame, 512x512 to 1920x1080. Motion text up to 280 characters.
- Veo 3.1 Lite: plain text prompt or image. Supports 16:9, 9:16, 1:1 aspect ratios.
Output specifications
Smart Shot always returns MP4 at 24 fps. Veo 3.1 Lite returns MP4 at 24 fps with optional 30 fps setting.
Where each fits in production workflows
Use Smart Shot when you need exact camera paths over a static plate. A product shot that pans 30 degrees left while zooming 15 percent is a typical case. Use Veo 3.1 Lite when the scene must be invented from a paragraph description, such as an establishing shot of a city at dusk.
Link first to the tool pages when mentioning them: Smart Shot and Veo.
Credit costs on Flixly
Smart Shot charges 4 credits for a 4-second 1080p clip. Veo 3.1 Lite charges 6 credits for a 5-second 720p clip. Longer durations scale linearly.
Direct comparison table
| Tool/Model | Max duration | Resolution | Credit cost (base) | Best input type | Known limit |
|---|---|---|---|---|---|
| Smart Shot | 8 seconds | 1080p | 4 credits | Reference image | Fixed camera path length |
| Veo 3.1 Lite | 10 seconds | 720p | 6 credits | Text or image | Lower default resolution |
| Text to Video | 12 seconds | 1080p | 5 credits | Text | No reference image support |
| Image to Video | 8 seconds | 1080p | 4 credits | Image | Shorter max clip |
The table shows Smart Shot wins on resolution and cost for reference-driven shots. Veo 3.1 Lite wins on text flexibility.
When to use which
Start with Smart Shot if you already have a locked-off plate and need repeatable motion. Switch to Veo 3.1 Lite when the scene must be generated from scratch without a reference frame. Test both on the same prompt set to see which matches your timeline.
FAQ
What input format works best for Smart Shot? Smart Shot expects a single high-resolution still as the first frame plus a short motion instruction. PNG files with clean edges give the most stable tracking.
How many seconds does Veo 3.1 Lite reliably produce? Veo 3.1 Lite defaults to 5-second clips. You can request up to 10 seconds, but coherence drops after 7 seconds in most tests.
Can I combine outputs from both models in one edit? Yes. Export both as MP4, then import into any NLE. The frame rates match at 24 fps so no speed adjustments are needed.
Does Smart Shot support character consistency across shots? No. It treats each generation as an independent clip. Use Reference to Video if you need the same character across multiple takes.
What aspect ratios are supported by Veo 3.1 Lite? It supports 16:9, 9:16, and 1:1. Custom ratios outside these three are rejected at queue time.
Where to start
Open the Smart Shot page, upload a test frame, and run a 4-second pan. Compare the result to a text-only generation on Veo 3.1 Lite using the same scene description.
Frequently Asked Questions
What input format works best for Smart Shot?▾
Smart Shot expects a single high-resolution still as the first frame plus a short motion instruction. PNG files with clean edges give the most stable tracking.
How many seconds does Veo 3.1 Lite reliably produce?▾
Veo 3.1 Lite defaults to 5-second clips. You can request up to 10 seconds, but coherence drops after 7 seconds in most tests.
Can I combine outputs from both models in one edit?▾
Yes. Export both as MP4, then import into any NLE. The frame rates match at 24 fps so no speed adjustments are needed.
Does Smart Shot support character consistency across shots?▾
No. It treats each generation as an independent clip. Use Reference to Video if you need the same character across multiple takes.
What aspect ratios are supported by Veo 3.1 Lite?▾
It supports 16:9, 9:16, and 1:1. Custom ratios outside these three are rejected at queue time.