Grok Imagine AI video model
Text-to-video and image-to-video generation with synchronized audio.
- Text / Image input support in one dashboard
- 1-8 credits depending on length
- Supports native audio
Model overview
Vidyu support for Grok Imagine
This page lists the model's supported inputs, output ranges, and available controls in Vidyu.
Typical use
Text-to-video and image-to-video generation with synchronized audio.
Output range
1 to 15 sec output with 480p to 720p resolution options.
Notes
Generation. Supports native audio. No required reference input.
Key controls
The main controls exposed for Grok Imagine
These are the model-specific controls currently available in Vidyu.
Length
Between 1 and 15 seconds.
Resolution
720p is sharper, 480p is faster.
Aspect Ratio
Auto lets the model decide based on your prompt.
Alternatives
Related models to compare
These are nearby options if you want to compare a similar setup before choosing a model.
Runway
Gen4 Aleph
Runway’s video-first generation and transformation model built around prompt-plus-video inputs.
Hailuo
Hailuo 02
Physics-focused text-to-video and image-to-video with simple 6s/10s presets.
Hailuo
Hailuo 2.3
Higher-fidelity Hailuo for realistic motion, cinematic VFX, and stronger prompt adherence.
Simple Pricing
Pay for what you make. 1 credit = 1 second of video. Start at $20/month or go bigger. Cancel anytime.
One Plan
Start at $20/month for 100 credits (= 100 seconds of video). Need more? Go up to 6,000 credits. Change anytime.
- Make videos from text or images
- Download instantly, and they're yours to keep
- Change your plan anytime
- No contracts or commitments
- Cancel whenever you want
FAQ
Common questions about Grok Imagine
Ready to Make Your First Video?
Sign in with Google and you’ll be making videos in the next 5 minutes. Seriously, it’s that easy.