Text models
| Model | Slug | Context | Max output | Stream | Tools | JSON | Vision | Files |
|---|---|---|---|---|---|---|---|---|
| Grok 4-1 Fast | grok-4-1-fast | 2M | 32K | ✓ | ✓ | ✓ | — | images (jpeg/png/gif/webp) |
| Grok 4-1 Fast Reasoning | grok-4-1-fast-reasoning | 2M | 32K | ✓ | ✓ | ✓ | — | images |
| Grok 4.20 | grok-4.20 | 2M | 32K | ✓ | ✓ | ✓ | — | images |
| Grok 4.20 Multi-Agent | grok-4.20-multi-agent | 2M | 32K | ✓ | ✓ | ✓ | — | images |
| Grok 4.20 Reasoning | grok-4.20-reasoning | 2M | 32K | ✓ | ✓ | ✓ | — | images |
Image models
| Model | Slug | Sizes | Max N | Aspect ratios | Edits |
|---|---|---|---|---|---|
| Grok Imagine Image | grok-imagine-image | 1K, 2K | 4 | 1:1, 16:9, 9:16, 3:2, 2:3, 4:3, 3:4, 2:1, 1:2, 19.5:9, 9:19.5, 20:9, 9:20, auto | ✓ (up to 5 images) |
| Grok Imagine Image Pro | grok-imagine-image-pro | 1K, 2K | 4 | 1:1, 16:9, 9:16, 3:2, 2:3, 4:3, 3:4, 2:1, 1:2, 19.5:9, 9:19.5, 20:9, 9:20, auto | ✓ (up to 5 images) |
Audio models
Text-to-speech
| Model | Slug | Output formats |
|---|---|---|
| Grok TTS | grok-tts | mp3, wav, pcm, mulaw, alaw |
Video models
| Model | Slug | Durations | Resolutions | Aspect ratios |
|---|---|---|---|---|
| Grok Imagine Video | grok-imagine-video | 1–15s | 480p, 720p | 16:9, 9:16, 1:1, 4:3, 3:4, 3:2, 2:3 |

