Gallery

Contacts

405 W. Greenlawn Ave Lansing, Michigan 48910

contact@techjacksolutions.com

+1-616-320-4064

Skip to content
Technology Daily Brief Vendor Claim

MiniMax M3 Is Live: Open-Weight Agentic AI With 1M Context at a Fraction of Frontier Model Costs

3 min read OpenRouter Partial Moderate
MiniMax M3 launched June 1 with a 1-million-token context window, independent BenchLM rankings placing it 29th of 119 models overall and 13th for agentic tasks, and API pricing at $0.30 per million input tokens, a fraction of comparable closed-model costs. Open weights are committed within 10 days of launch, which would make the full model downloadable before mid-June.
BenchLM overall rank, #29 of 119

Key Takeaways

  • BenchLM.ai independently ranks MiniMax M3 #29 of 119 models overall (76/100) and #12 of 28 on the verified leaderboard, not a vendor number
  • Launch API pricing is $0.30/$1.20 per million tokens (input/output), roughly 5-10% of comparable closed-model costs per VentureBeat
  • SWE-bench Pro performance claims (59% vs. GPT-5.5 at 58.6%) are MiniMax's own evaluation, independent replication is pending
  • Open weights committed to Hugging Face within 10 days of June 1 launch, a commitment, not a completed release as of publication

Model Release

MiniMax M3
OrganizationMiniMax
TypeOpen Source LLM
ParametersNot disclosed
BenchmarkBenchLM: 76/100 (#29/119 provisional); Agentic 82.4/100; Coding 87.4/100, [SELF-REPORTED] SWE-bench Pro: ~59% per MiniMax
AvailabilityAPI (MiniMax, OpenRouter), open weights committed within 10 days of 2026-06-01

Verification

Partial BenchLM.ai (independent, T2) + OpenRouter (T3) + VentureBeat (T3) SWE-bench Pro head-to-head figures are self-reported by MiniMax. Epoch AI evaluation pending. Open-weight release not yet completed.

MiniMax M3 is available now. The model launched June 1 via the MiniMax API and OpenRouter, with a 1,048,576-token context window built on MiniMax Sparse Attention (MSA) architecture and support for text, image, and video inputs. Launch pricing is $0.30 per million input tokens and $1.20 per million output tokens, a 50% discount from the standard rate of $0.60/$2.40. That standard rate still puts it well below what GPT-5.5 and Gemini 3.1 Pro charge for comparable context sizes.

BenchLM.ai’s independent evaluation places M3 at #29 of 119 models on the provisional leaderboard with an overall score of 76/100, and #12 of 28 on the verified leaderboard. BenchLM’s category breakdowns show an agentic score of 82.4/100 and a coding score of 87.4/100, both from the same BenchLM source as the confirmed overall ranking. BenchLM is an independent third-party evaluator, not a vendor metric.

On coding benchmarks, MiniMax claims M3 scores approximately 59% on SWE-bench Pro, which the company says edges out GPT-5.5 at 58.6% and Gemini 3.1 Pro at 54.2%. Independent replication of those figures is pending. The SWE-bench Pro claims originate from MiniMax’s own announcement, amplified through social posts, not from a replication study. Don’t treat the head-to-head numbers as confirmed until Epoch AI or a comparable independent evaluator weighs in.

MiniMax describes M3 as the first open-weight model to combine native image, video, and computer-use capabilities with a 1M-token context window. That claim hasn’t been verified against prior releases.

API Pricing, Input / Output (per million tokens)

MiniMax M3 (launch)
$0.30 / $1.20
MiniMax M3 (standard)
$0.60 / $2.40
Competitor pricing
[URL-NEEDED: vendor API pricing pages, GPT-5.5, Gemini 3.1 Pro]

Why this matters for your stack

The pricing gap is the real story here. At $0.30/$1.20 per million tokens on launch pricing, M3 costs roughly 5-10% of comparable closed-model API rates, according to VentureBeat’s analysis. For teams running high-volume agentic workflows, where context window usage and token throughput compound quickly, that cost differential makes M3 worth evaluating even before the open weights drop. The 1M context window is already available via API, so you don’t have to wait.

The agentic ranking (BenchLM #13, score 82.4/100) holds up independently. That’s not a vendor number. For teams evaluating models for tool-use and multi-step reasoning tasks, BenchLM’s agentic category is one of the more rigorous available assessment frameworks.

The catch is open-weight timing. Company leadership committed to releasing weights on Hugging Face within 10 days of June 1. That’s a commitment, not a completed action. If you’re making architecture decisions contingent on local or self-hosted deployment, wait for the actual release before finalizing.

What to Watch

Open weights release on Hugging FaceBy ~2026-06-11
SWE-bench Pro independent replicationTBD
Epoch AI evaluationTBD
MiniMax technical report publicationTBD

Disputed Claim

MiniMax M3 scores ~59% on SWE-bench Pro, exceeding GPT-5.5 (58.6%) and Gemini 3.1 Pro (54.2%)
All SWE-bench Pro figures originate from MiniMax's own announcement. No independent replication available as of 2026-06-02.
Use BenchLM.ai's independently confirmed rankings for evaluation decisions. Treat SWE-bench Pro head-to-head as pending confirmation.

What to watch

Three things remain unresolved: independent replication of the SWE-bench Pro figures, the actual open-weight release on Hugging Face (expected by June 11), and the technical report that would detail M3’s architecture, training data, and context handling at scale. Epoch AI evaluation is pending. The BenchLM ranking is solid, but coding benchmark performance at production scale, latency, throughput, cost per completed task, isn’t captured in leaderboard scores.

TJS synthesis

M3’s BenchLM rankings are real. The pricing is real. The SWE-bench Pro head-to-head isn’t independently confirmed, so price and context window are the evaluation anchors right now. Wait for independent benchmarks before migrating production agentic workloads. If you’re cost-sensitive and can tolerate self-reported benchmark uncertainty, the API is worth testing against your own evaluation suite this week, the weights aren’t a prerequisite for that.

View Source
More Technology intelligence
View all Technology

Related Coverage

Stay ahead on Technology

Get verified AI intelligence delivered daily. No hype, no speculation, just what matters.

Explore the AI News Hub