Gemini Omni Flash May 2026 Update
Google released its May 2026 refresh for Gemini Omni Flash last week. The update centers on faster voice synthesis and tighter integration between text, image, and audio outputs. What's New in Gemini...
TL;DR
Gemini Omni Flash received a major May 2026 update focused on real-time voice and multimodal speed. The model now supports 30 preset voices, 80+ languages, and instant multi-speaker dialogue. New pricing starts at 2 credits per minute for voice and 3 credits per image when paired with Flixly's Text to Speech and Text to Video tools.
Google released its May 2026 refresh for Gemini Omni Flash last week. The update centers on faster voice synthesis and tighter integration between text, image, and audio outputs.
What's New in Gemini Omni Flash May 2026
The latest build adds native 80-language support and reduces average voice latency to 180 ms. Users can now run multi-speaker conversations with up to four distinct voices in a single generation call.
Voice and Audio Upgrades
- 30 preset voices covering 10 new accents
- Instant voice cloning from a 10-second sample
- Real-time dialogue mode for podcasts and shorts
Image and Video Enhancements
- 4K output at 8-second clips via Text to Video
- Improved text rendering inside generated images
- Automatic background removal after generation
How to Use Gemini Omni Flash in Flixly
Open the Text to Speech tool and select Gemini 3.1 Flash TTS. Paste your script, choose a preset voice, and hit generate. For video, switch to Text to Video and pick the Omni Flash backend.
Step-by-Step Workflow
- Write a 60-second script.
- Clone a voice or pick a preset.
- Generate audio track.
- Upload audio to Auto Captions for subtitles.
- Add the final video to Shorts Generator for vertical export.
Gemini Omni Flash vs Previous Version
| Feature | May 2026 | April 2026 |
|---|---|---|
| Latency | 180 ms | 320 ms |
| Languages | 80+ | 45 |
| Max speakers | 4 | 2 |
| Credit cost (voice) | 2/min | 4/min |
The new version cuts credit use in half while doubling language coverage.
Real-World Examples and Workflows
A marketing team used Gemini Omni Flash to create 12 short-form videos in one afternoon. They started with Music Generation for background tracks, then synced everything inside Lip Sync Video. Each 15-second clip cost 5 credits total.
Creators on Flixly can now move from script to finished short in under 8 minutes. Export the final file at 1080p or 4K directly from the platform.
Pricing and Credit Costs
Gemini Omni Flash voice generation starts at 2 credits per minute. Image tasks run 3 credits each. Video clips under 8 seconds are charged 4 credits. Monthly plans begin at $19 for 500 credits.
Getting Started with Flixly
Sign up at Sign Up and test the new Gemini Omni Flash models immediately. All features appear in the dashboard under AI Audio and AI Video sections.
Frequently Asked Questions
What changed in Gemini Omni Flash May 2026?▾
The May 2026 update added 80-language support, reduced voice latency to 180 ms, and introduced multi-speaker dialogue with up to four voices.
How much does Gemini Omni Flash cost in Flixly?▾
Voice generation starts at 2 credits per minute and image generation costs 3 credits per image. Video clips under 8 seconds cost 4 credits.
Can I use Gemini Omni Flash for YouTube Shorts?▾
Yes. Combine the Text to Speech tool with Shorts Generator and Auto Captions to create complete vertical videos under 8 minutes.
Does Gemini Omni Flash support voice cloning?▾
The update includes instant voice cloning from a 10-second audio sample that works directly inside the Text to Speech workflow.
What languages are supported?▾
Gemini Omni Flash now supports more than 80 languages and 10 new regional accents with 30 preset voices available.
Is Gemini Omni Flash better than the April version?▾
Yes. The May build halves credit cost, doubles language support, and cuts average latency from 320 ms to 180 ms.