Grok Imagine API: A Complete Python Guide With Examples

Learn how to generate videos using the Grok Imagine API. This Python guide covers everything from image animations to video editing with the new xAI video model.

Feb 2, 2026 · 8 min read

xAI just released the Grok Imagine API, which generates AI videos using their Grok Imagine model. It claims to be on par with other current top models, even though it was optimized for low latency and cost.

In this article, we will see whether Grok Imagine can live up to these promises. In this comprehensive guide, I will show you how to set up the API locally and how to use Grok Imagine with text, images, or video input to generate videos in Python.

If you want to learn more about the latest releases in this space, check out our guide to the top video generation models.

What is Grok Imagine?

Grok Imagine is a video generation model from xAI. It accepts text, image, and video input and generates a video with native sound. Being able to receive video input is especially interesting, since it allows us to perform prompt-driven edits.

Their model outperforms the latest Veo models from Google and Sora from OpenAI, both in price and in time to generate a video.

The above comparison chart focuses only on price and speed. As we can see, Grok Imagine takes the first place in both categories.

When it comes to the videos themselves, the announcement only compares Grok Imagine with Kling o1 and Runway Aleph. The experiment consisted of sending the same prompt to all models and having human evaluators select their preferred result.

Apart from not comparing the results with Sora and Veo, I found it odd that Runway Aleph isn't even the latest model released by Runway ML. Still, Grok Imagine takes the first spot in the ELO-based Artificial Analysis text-to-video ranking, even outranking Runway Gen 4.5.

How to Access Grok Imagine

Grok Imagine can be accessed either via its web interface or its API. In this tutorial, we focus on using Grok Imagine with the API using Python.

When using a model via the API, we need an account but not a subscription. Instead, we create an API key in our account that charges us on a per-video basis. Below, you can see the pricing structure of the Grok Imagine API:

Grok Imagine API Setup

Before creating our first Grok Imagine video, we need to generate an API key and install the necessary dependencies.

Generating an API key

The first step in setting up the Grok Imagine API is to generate an API key. To create one, go to the xAI console and log in with your xAI account. You will need a team, so create one for your account if none exists yet.

Next, navigate to the API Keys tab and click the Create API Key button on the right.

We copy the key into a file named .env that we create in the same folder where we'll write our Python code. The file should have the following format:

XAI_API_KEY="your_api_key"

Installing Python xAI packages

To interact with the Grok Imagine API, we install two Python packages:

xai-sdk: The official xAI package that allows us to make API requests.
python-dotenv: A helper package that makes it easy to load the API key from the .env file.

We install these packages using the command:

pip install xai-sdk python-dotenv

Generating our first video with Grok Imagine

With the steps above complete, we can now generate a video. To do so, we import the two packages we just installed, load the API key, initialize the xAI client, and finally, send a video generation request.

Here's a sample Python script for doing so:

from xai_sdk import Client
from dotenv import load_dotenv
# Load the API key
load_dotenv()
# Initialize the xAI client
client = Client()
# Sending a video generation request
prompt = """
A pixel art cat playing with a ball.
"""
response = client.video.generate(
    prompt=prompt,
    model="grok-imagine-video",
)
# Display the URL of the video
print(f"Video URL: {response.url}")

Here’s the video that was generated:

Note: All the code for this article can be found in this GitHub repository.

Downloading the video

The code above will wait for the video to finish generating, then print the video's URL. We can also download the video using this download_video() function:

import requests
from pathlib import Path
from urllib.parse import urlparse
def download_video(url: str, output_dir: str = "."):
    # Extract filename from URL
    filename = Path(urlparse(url).path).name
    if not filename:
        raise ValueError("Could not determine filename from URL")
output_path = Path(output_dir) / filename
with requests.get(url, stream=True) as r:
        r.raise_for_status()
        with open(output_path, "wb") as f:
            for chunk in r.iter_content(chunk_size=8192):
                if chunk:
                    f.write(chunk)
  print(f"Video saved to {output_path.resolve()}")

After defining this function, calling download_video(response.url) will download the video to your working directory.

A full example with video download can be found in the repository.

Generating Videos With the Grok Imagine API

We learned how to generate videos from text using the xAI API. Next, we’ll explore the full capabilities of the model. For more information, you can always consult the xAI documentation.

Exploring video generation options

Grok Imagine provides three main options when generating videos:

duration: The video duration in seconds, provided as a number from 1 to 15.
aspect_ratio: The aspect ratio of the video. The model supports the following aspect ratios: "1:1", "16:9", "9:16", "4:3", "3:4", "3:2", and "2:3".
resolution: The resolution for the video, either "720p" or "480p".

Here's an example of how we can set the above options in the API request:

prompt = """
A person stands holding their phone, gazing at a stunning landscape 
photo on the screen. The image begins to subtly move and glow. 
Suddenly, the phone pulls them in, and they are sucked through the screen, 
transitioning seamlessly into the vast, breathtaking landscape itself.
"""
response = client.video.generate(
    prompt=prompt,
    model="grok-imagine-video",
    duration=15,
    aspect_ratio="9:16",
    resolution="480p",
)

Here's the result:

I find this result to be underwhelming, to say the least.

Generating a video from an image

One of the most powerful features of AI image generation models is their ability to generate videos from an image. This workflow makes it much easier to create consistent videos, provided we have a base image, as the model only needs to generate motion.

Imagine wanting to create a video with a specific character or person. Providing an image, in theory, at least guarantees character accuracy.

To generate an image from an image, use the image_url parameter with a URL to the image we want to use. According to their documentation, when an image is provided, it will be used as the first frame of the video. We need to make sure, however, that it matches the requested aspect ratio.

I tried generating an FPV drone shot of people riding horses at the beach from the photo below, which I took a while back. Note that because the image needs to be provided as a URL, we first need to upload it somewhere. In my case, I used the GitHub repository associated with this article.

prompt = """
A FPV drone shot of the people riding the horses on the beach.
"""
response = client.video.generate(
    prompt=prompt,
    model="grok-imagine-video",
    image_url="https://raw.githubusercontent.com/fran-aubry/grok-imagine-tutorial/refs/heads/main/resources/horses.jpeg",
)

This is the video Grok Imagine generated:

Then, I used that video’s URL to ask the model to add fire to the balls. Below is the request for editing the video. The full code can be found in the GitHub repository.

prompt = """
Add fire to the balls.
"""
response = client.video.generate(
    prompt=prompt,
    model="grok-imagine-video",
    video_url="https://vidgen.x.ai/xai-vidgen-bucket/xai-video-2109c762-efcb-415b-ab3c-661b1df113cd.mp4",
)

I also did a third edit where I asked to replace the person with a cat. Here are the results:

This is another example where the results are quite bad. In the first video, a new path was created that is inconsistent with the scene. The snow edit looked good at first, but then I realized it left the background unchanged, making it inconsistent with the now snowy foreground.

Conclusion

It’s encouraging to see more video models arrive with API access, since that genuinely expands what we can build and automate, from quick prototypes to full pipelines.

Grok Imagine’s API is refreshingly simple to set up and call, but it does come with friction points. Most notably, the requirement to pass images and videos by URL makes basic workflows (like iterating on local assets) more cumbersome than necessary.

In practice, my results were underwhelming compared to the lofty claims: text-to-video often missed the mark, and image-to-video introduced noticeable artifacts and inconsistencies. The one area where it did shine was editing, where prompt-driven changes felt more reliable and controllable.

One very strong point of Grok Imagine is its speed. I've used a lot of AI video generation models, and from my experience, Grok Image is by far the fastest.

I’m optimistic about where this space is headed, but for now Grok Imagine feels more like a promising editor than a best-in-class generator, and I hope future updates broaden input options and boost core generation quality.

To those of you who want to learn more about the techniques used in AI video generation, I recommend enrolling in our AI Fundamentals skill track.

How can I access the Grok Imagine API?

How much does video generation with the Grok Imagine API cost?

What capabilities does the Grok Imagine API support?

How can you create videos from images or other videos in the Grok Imagine API?

Author

François Aubry

Full-stack engineer & founder at CheapGPT. Teaching has always been my passion. From my early days as a student, I eagerly sought out opportunities to tutor and assist other students. This passion led me to pursue a PhD, where I also served as a teaching assistant to support my academic endeavors. During those years, I found immense fulfillment in the traditional classroom setting, fostering connections and facilitating learning. However, with the advent of online learning platforms, I recognized the transformative potential of digital education. In fact, I was actively involved in the development of one such platform at our university. I am deeply committed to integrating traditional teaching principles with innovative digital methodologies. My passion is to create courses that are not only engaging and informative but also accessible to learners in this digital age.

Topics

Artificial Intelligence

Generative AI

Generative AI Courses

Track

Associate AI Engineer for Developers

26 hr

Learn how to integrate AI into software applications using APIs and open-source libraries. Start your journey to becoming an AI Engineer today!

See Details

Start Course

Course

Generative AI Concepts

2 hr

87.7K

Discover how to begin responsibly leveraging generative AI. Learn how generative AI models are developed and how they will impact society moving forward.

See Details

Start Course

Course

Working with the OpenAI API

3 hr

102.2K

Start your journey developing AI-powered applications with the OpenAI API. Learn about the functionality that underpins popular AI applications like ChatGPT.

See Details

Start Course

Tutorial

Grok 4 API: A Step-by-Step Guide With Examples

Learn how to use Grok 4’s API through practical examples featuring image recognition, reasoning, function calling, and structured output.

Tom Farnschläder

Tutorial

Grok 3 API: A Step-by-Step Guide With Examples

Learn how to use the Grok 3 API for tasks ranging from basic queries to advanced features like function calling and structured outputs.

Tom Farnschläder

Tutorial

Groq LPU Inference Engine Tutorial

Learn about the Groq API and its features with code examples. Additionally, learn how to build context-aware AI applications using the Groq API and LlamaIndex.

Abid Ali Awan

Tutorial

Sora 2 API With Python: A Complete Guide With Examples

Learn how to bring your video ideas to life using the Sora 2 API with this complete guide on how to use Python to interact with the OpenAI API.

François Aubry

Tutorial

Imagen 3: A Guide With Examples in the Gemini API

Learn how to generate images using Google’s Imagen 3 API with Python, including setting up your environment and adjusting options like aspect ratio and safety filters.

François Aubry

Tutorial

Seven Grok 4 Examples to Try in the Chat Interface

Learn how to make the most of Grok 4 through seven practical examples you can try in the chat interface.

Aashi Dutt

See More See More

Grok Imagine API: A Complete Python Guide With Examples

What is Grok Imagine?

How to Access Grok Imagine

Grok Imagine API Setup

Generating an API key

Installing Python xAI packages

Generating our first video with Grok Imagine

Downloading the video

Generating Videos With the Grok Imagine API

Exploring video generation options

Generating a video from an image

Editing Videos With the Grok Imagine API

Conclusion

Grok Imagine API FAQs

How can I access the Grok Imagine API?

How much does video generation with the Grok Imagine API cost?

What capabilities does the Grok Imagine API support?

How can you create videos from images or other videos in the Grok Imagine API?

Grok 4 API: A Step-by-Step Guide With Examples

Grok 3 API: A Step-by-Step Guide With Examples

Groq LPU Inference Engine Tutorial

Sora 2 API With Python: A Complete Guide With Examples

Imagen 3: A Guide With Examples in the Gemini API

Seven Grok 4 Examples to Try in the Chat Interface

Associate AI Engineer for Developers

Generative AI Concepts

Working with the OpenAI API

Grok 4 API: A Step-by-Step Guide With Examples

Grok 3 API: A Step-by-Step Guide With Examples

Groq LPU Inference Engine Tutorial

Sora 2 API With Python: A Complete Guide With Examples

Imagen 3: A Guide With Examples in the Gemini API

Seven Grok 4 Examples to Try in the Chat Interface

What is Grok Imagine?

How to Access Grok Imagine

Grok Imagine API Setup

Generating an API key

Installing Python xAI packages

Generating our first video with Grok Imagine

Downloading the video

Generating Videos With the Grok Imagine API

Exploring video generation options

Generating a video from an image

Editing Videos With the Grok Imagine API

Conclusion

Grok Imagine API FAQs

What capabilities does the Grok Imagine API support?

How can you create videos from images or other videos in the Grok Imagine API?

Grok 4 API: A Step-by-Step Guide With Examples

Grok 3 API: A Step-by-Step Guide With Examples

Groq LPU Inference Engine Tutorial

Sora 2 API With Python: A Complete Guide With Examples

Imagen 3: A Guide With Examples in the Gemini API

Seven Grok 4 Examples to Try in the Chat Interface

.css-1531qan{-webkit-text-decoration:none;text-decoration:none;color:inherit;}Associate AI Engineer for Developers

Generative AI Concepts

Working with the OpenAI API

Grok 4 API: A Step-by-Step Guide With Examples

Grok 3 API: A Step-by-Step Guide With Examples

Groq LPU Inference Engine Tutorial

Sora 2 API With Python: A Complete Guide With Examples

Imagen 3: A Guide With Examples in the Gemini API

Seven Grok 4 Examples to Try in the Chat Interface

Associate AI Engineer for Developers