AI Video News: Grok Imagine Video 1.5 Launches, 25-Second Generation, Better Physics, Audio in One Pass

June 17, 2026 3 min read xAI Blog, Grok Imagine Video 1.5 Confirmed Strong

Tech Jacks Solutions AI News Coverage

xAI released Grok Imagine Video 1.5 on June 16, making it generally available on the Grok Imagine API and rolling out Video 1.5 Fast to grok.com/imagine, iOS, and Android. The confirmed headline number: a 6-second 720p video now generates in approximately 25 seconds, down from 40-plus seconds with the previous model.

ai-video-news xai grok-imagine video-generation ai-tools-news ai-developer-tools-news ai-models-news

Video generation time, ~25 sec (6s 720p)

Key Takeaways

Grok Imagine Video 1.5 generally available on Grok Imagine API as of June 16; Video 1.5 Fast live on grok.com/imagine, iOS, Android
Confirmed speed improvement: 6-second 720p video in ~25 seconds, down from 40+ seconds, approximately 40% faster (vendor-stated, verified from fetched source)
Audio and speech generate in the same pass as video, architectural improvement, not just a feature addition "State-of-the-art" claim is xAI's own framing, no third-party benchmark comparing Video 1.5 against Sora, Kling, or Runway has been published

Model Release

Grok Imagine Video 1.5

OrganizationxAI

TypeAI Tool Update — Video Generation

ParametersNot disclosed

BenchmarkNo third-party benchmark published. Vendor-stated: 6-second 720p video in ~25 seconds (down from 40+ seconds).

AvailabilityGrok Imagine API (general availability) + grok.com/imagine + iOS + Android

Video Generation Speed, 6-second 720p clip (vendor-stated)

Grok Imagine Video 1.5 Fast

~25 seconds

Grok Imagine Video 1.0 (previous)

40+ seconds

Sora / Kling / Runway equivalents

No comparable public benchmark available

The speed improvement is real and vendor-confirmed. Per xAI’s release post, Grok Imagine Video 1.5 Fast produces 6-second, 720p videos in approximately 25 seconds, down from 40-plus seconds with the prior model. That’s roughly a 40% reduction in generation time. For developers running API-based video workflows, latency is often the constraint on usability, this moves the number meaningfully.

Video 1.5 is generally available on the Grok Imagine API as of June 16. Video 1.5 Fast, the speed-optimized variant, is live on grok.com/imagine and on xAI’s iOS and Android apps. The two tiers serve different use cases: the standard API release for production pipeline integration, the Fast variant for real-time and consumer-facing applications.

Beyond speed, xAI claims improvements across audio, motion, and physics. Audio and speech generate in the same pass as video, that’s an architectural note, not just a feature claim, because it means the model doesn’t require a separate audio generation step. Motion consistency and physical plausibility are described as improved over Video 1.0. xAI describes this as its best image-to-video model to date. No third-party benchmark comparing Grok Imagine Video 1.5 against Sora, Kling, or Runway has been published. That “state-of-the-art” framing is xAI’s positioning, not an established fact.

Disputed Claim

Grok Imagine Video 1.5 is state-of-the-art for video generation

Self-asserted by xAI; no independent benchmark evaluation against competing models (Sora, Kling, Runway) has been published

Use the confirmed speed metric (25 sec / 6-sec 720p) for API adoption decisions. Wait for third-party comparative benchmarks before evaluating quality positioning.

The part nobody mentions in video model announcements: vendor speed benchmarks are measured under controlled conditions. Generation time at production batch sizes, when you’re running dozens of concurrent requests through the API, typically differs from the headline latency figure. The 25-second number is a useful baseline, but developers should run their own throughput tests at their expected concurrency levels before committing to pipeline architecture decisions.

The Grok Build v0.2.52 release notes also document a Grok for PowerPoint add-in, which allows users to generate and edit slide presentations within Microsoft PowerPoint, according to the Grok Build changelog. That source wasn’t directly fetched during verification, treat the PowerPoint feature as confirmed from vendor documentation but not independently corroborated. The capability described: text outline to slides, expand existing decks, tighten narratives within the PowerPoint interface. It’s a niche addition relative to the video release, but it signals xAI’s push into enterprise productivity surfaces beyond the core model API.

For context on the broader Grok Build picture, including the Agent Dashboard released June 15: the prior brief covers that release in full. This brief focuses on the video and PowerPoint additions.

What to Watch

Independent benchmark evaluation of Grok Imagine Video 1.5 vs. competing models2-4 weeks

Developer community throughput reports at production API concurrency1-2 weeks

Grok for PowerPoint add-in independent coverage or user reports1-2 weeks

What to watch

Two specific signals: First, whether independent benchmark organizations publish a comparative video generation evaluation including Grok Imagine Video 1.5 in the next two to four weeks, that’s when the SOTA claim becomes testable. Second, developer community throughput reports at production concurrency. The headline latency number answers whether the model is fast enough for real-time applications in single-request conditions. Batch performance answers whether it’s fast enough for production pipelines.

TJS synthesis

The confirmed improvement, 40% generation time reduction, audio-in-one-pass architecture, is meaningful for developers already on the Grok Imagine API. For developers evaluating which video generation API to adopt, wait for independent benchmarks before treating xAI’s SOTA claim as a selection criterion. The speed number is the most useful figure in this announcement. Use it, test it at your concurrency, and reserve judgment on overall quality positioning until third-party evaluations publish.