DeepSeek Makes V4 Pro's 75% Price Cut Permanent, What the New Price Floor Means for Enterprise AI Costs

May 25, 2026 2 min read DeepSeek API Documentation Partial

Tech Jacks Solutions AI News Coverage

DeepSeek announced on May 23 that its promotional 75% discount on the V4 Pro API won't expire on May 31, it's becoming the permanent base price. For enterprise teams evaluating agentic workflow costs, that changes the math.

deepseek api-pricing llm-cost agentic-ai ai-models-news ai-tools-news enterprise-ai inference-cost

V4 Pro input price, $0.435/M tokens

Key Takeaways

DeepSeek confirmed its 75% promotional discount on V4 Pro API will not expire May 31, it becomes the permanent base price, per DeepSeek's announcement
New permanent input rate: $0.435/M tokens (down from $1.74/M); output $0.87/M; cached input $0.0036/M, verified against API documentation
According to Artificial Analysis, V4 Pro holds a leading position on the cost-to-intelligence frontier; specific comparative ratios require attribution to Artificial Analysis data
Western enterprise adoption may face data sovereignty and geopolitical review independent of the pricing advantage, cost advantage doesn't automatically translate to procurement approval

Model Release

DeepSeek V4 Pro

OrganizationDeepSeek

TypeLLM — Flagship

ParametersNot disclosed

BenchmarkNot disclosed for this pricing announcement

AvailabilityAPI, permanent pricing effective post-May 31 2026

Input Token Price (per million tokens)

DeepSeek V4 Pro (new permanent)

$0.435

DeepSeek V4 Pro (prior rate)

$1.74

DeepSeek V4 Pro cached input

$0.0036

The promotional rate was supposed to end. According to DeepSeek, it isn’t.

On May 23, DeepSeek announced that the 75% discount applied to V4 Pro API access, scheduled to expire May 31, will instead become the permanent base price. Per DeepSeek’s API documentation, V4 Pro input tokens are now priced at $0.435 per million, output at $0.87 per million, and cached input at $0.0036 per million. The prior input rate was $1.74 per million tokens. That’s not a rounding adjustment. It’s a structural reset.

The catch is that the competitive significance depends entirely on what you’re comparing against. According to Artificial Analysis, an independent AI benchmarking firm that tracks cost-to-intelligence ratios across frontier models, DeepSeek V4 Pro holds a leading position on the cost-to-intelligence frontier. Artificial Analysis’s comparative benchmark data places total run costs for V4 Pro well below those of competing frontier models, figures cited from that analysis put DeepSeek’s suite run costs against several thousand dollars for GPT-5.5 and Claude Opus 4.7 alternatives, though those specific numbers haven’t been independently confirmed by TJS from the primary dashboard for this cycle. The directional picture is consistent across secondary coverage.

The pricing structure is particularly advantageous for enterprise agentic workflows that process large token volumes, where per-token costs compound significantly at scale. A workflow that runs 100 million input tokens monthly, not unusual for a moderately active multi-agent system, costs $43.50 at the new DeepSeek rate versus $174 at the prior rate. That differential widens fast against comparable frontier model pricing.

The part nobody mentions:

enterprise adoption in Western markets doesn’t run on pricing alone. Data sovereignty requirements, procurement review timelines, and geopolitical considerations around Chinese AI infrastructure can override a compelling cost argument before it reaches a budget conversation. The price is real. So is that friction.

This announcement advances a story TJS covered in May on inference cost collapse across the frontier model market. DeepSeek’s May 13 cost-performance positioning piece established the competitive framing; the May 23 permanence decision is the follow-through. Where the earlier coverage asked whether DeepSeek’s cost advantage was durable, the permanence announcement answers that question, at least on the pricing side.

Analysis

Three major frontier model pricing events in 30 days, Gemini 3.5 Flash, GPT-5.5 API, DeepSeek V4 Pro permanent, suggest the price floor across the tier-1 API market is actively contested. Enterprise teams benchmarking 2026 agentic infrastructure costs should treat these as inputs to vendor negotiations, not just adoption decisions.

What to watch

whether Anthropic, OpenAI, or Google respond with matching reductions or structural pricing changes before mid-June. Three significant pricing events in 30 days (Gemini 3.5 Flash, GPT-5.5 API, now DeepSeek permanent) suggest the frontier model pricing floor is actively contested, not settled. The next move worth tracking is whether enterprise procurement teams treat this as a switching trigger or a negotiating lever with incumbent providers.

TJS synthesis:

Don’t treat this as a discount story. DeepSeek restructured its price floor before a competitor could respond, locking in a cost-efficiency position at the moment enterprise teams are actively sizing agentic infrastructure budgets. Whether your organization can use it is a separate question from whether it changes your negotiating position with providers you can use. It does. Get the number in front of whoever owns your model spend.

View Source

More Technology intelligence

View all Technology

Deep Dive Available The NDA Evaluation Problem: What GPT-5.6 Sol's METR Assessment Reveals About AI...

Gallery

Contacts