Gallery

Contacts

405 W. Greenlawn Ave Lansing, Michigan 48910

contact@techjacksolutions.com

+1-616-320-4064

AI Model Rankings

Top 10 Open-Weight LLMs in 2026 (Ranked by Arena Elo)

These are the ten highest-rated open-weight large language models by LMArena Chatbot Arena Elo, restricted to models whose weights you can actually download. Eight of the ten ship under genuinely open MIT or Apache 2.0 licenses. We flag which models carry restrictive custom licenses, and we name the source behind every score so you can verify it.

1454
Top Arena Elo (GLM-5)
Onyx AI tracker, Mar 2026
8 of 10
MIT or Apache 2.0 Licensed
Fully open licenses
10
Ranked Open-Weight Models
Publicly downloadable weights
Feb-Mar
2026 Data Snapshot
Scores shift over time

How We Ranked These Models

Methodology

Ranked by LMArena Chatbot Arena Elo among models with publicly downloadable weights, as of Feb-Mar 2026 (LMArena, Onyx AI, LLM-Stats, and Iternal). Open weight is not the same as open source: license restrictiveness is flagged separately for each model.

Scores shift continuously as more head-to-head votes accumulate and new models enter the arena. Treat this order as directional, not absolute. Different trackers report slightly different figures for the same model, so we present a representative number and name the tracker it came from.

Arena Elo measures how often human voters prefer one model's response over another in blind head-to-head comparisons. It captures real-world helpfulness rather than a single static benchmark, which is why it is a useful primary sort for a general-purpose ranking. We pair it with one headline benchmark per model (SWE-bench Verified, MMLU-Pro, LiveCodeBench, GPQA Diamond, or HumanEval) so you can see where each model's measured strengths lie.

We restricted the list to models whose weights are genuinely downloadable. We then separated the leaders by license type. Models under MIT or Apache 2.0 are fully open. Models under custom vendor licenses, such as Meta's Llama Community License and Google's Gemma License, are open weight but carry usage restrictions, so we flag them in their own section rather than mixing them into the headline ten.


The Full Ranking

All ten ranked open-weight models at a glance, ordered by Arena Elo. Click any column header to sort, for example by Elo or by organization. Every score traces to the independent trackers named in the methodology above.

# Model Organization License Arena Elo Headline Benchmark
1GLM-5Zhipu AIMIT1454SWE-bench Verified 77.8
2Qwen 3.5 (397B)AlibabaApache 2.01450MMLU-Pro 87.8
3GLM-4.7Zhipu AIMIT1441SWE-bench Verified 73.8
4Kimi K2.5Moonshot AIMIT1438SWE-bench Verified 76.8
5DeepSeek V3.2DeepSeekMIT1423MMLU-Pro 85.0
6Qwen3-235B-A22BAlibabaApache 2.01423GPQA Diamond 71.1
7Mistral Large 3MistralApache 2.01416HumanEval 92.0
8MiniMax M2.5MiniMaxApache 2.01404SWE-bench Verified 80.2
9DeepSeek R1DeepSeekMIT1398MMLU-Pro 84.0
10MiMo-V2-FlashXiaomiMIT1393MMLU-Pro 84.9

Elo figures are a representative Feb-Mar 2026 snapshot from LMArena, Onyx AI, LLM-Stats, and Iternal. Benchmark scores are the headline metric reported for each model on those trackers. Parameter counts: GLM-5 744B, Qwen 3.5 397B, GLM-4.7 355B, Kimi K2.5 1T, DeepSeek V3.2 685B, Qwen3-235B-A22B 235B, Mistral Large 3 675B, MiniMax M2.5 230B, DeepSeek R1 671B, MiMo-V2-Flash 309B.


Strong Models With Restrictive Licenses

These models post competitive Arena Elo scores and their weights are downloadable, but they ship under custom licenses rather than MIT or Apache 2.0. We keep them out of the headline ten so readers do not assume they carry the same freedoms. They are still worth knowing about, particularly Llama for its large fine-tuning ecosystem.

Llama 4 Maverick
Meta | Arena Elo 1328 | MMLU-Pro 80.5
Ships under the Llama Community License. Restrictions include a 700 million monthly active user cap, a clause against using outputs to train competing models, and a carve-out limiting multimodal use in the EU. Open weight, but not open source.
Llama 4 Scout
Meta | Arena Elo 1323 | MMLU-Pro 74.3
The long-context, efficiency-focused variant in the Llama 4 family. Same Llama Community License restrictions apply. A common choice where the 700M MAU cap is not a concern and a large community of fine-tunes is valuable.
Gemma 3 27B
Google | Arena Elo 1366
Distributed under the Gemma License, a custom acceptable-use agreement with terms that differ from standard open-source licenses. Capable and efficient for its size, but verify the license terms fit your use case before deploying commercially.
Step-3.5-Flash
Stepfun | Arena Elo 1389 | Apache 2.0
An Apache 2.0 model that sits just below the headline ten on Elo, notable for very low hosted API pricing of around $0.10 input and $0.30 output per million tokens. Listed here for context on where the next tier of open models begins.

Open Weight Is Not the Same as Open Source

These two phrases get used interchangeably, but they mean different things, and the difference affects what you are legally allowed to do.

  • Open weight means the trained model parameters are published and you can download them. You can run the model on your own hardware, fine-tune it, and inspect its behavior. Every model on this page is open weight.
  • Open source is a stricter standard tied to a license that grants broad rights to use, modify, and redistribute without unusual restrictions. MIT and Apache 2.0 meet this bar. Custom vendor licenses, such as the Llama Community License and the Gemma License, generally do not.

Why it matters: a model can be open weight while still limiting commercial scale, restricting how outputs may be used, or carving out certain regions or use cases. Eight of the ten ranked models above use MIT or Apache 2.0, which is why they make the headline list. The restrictive-license models are flagged separately precisely so you can tell the difference at a glance. Before you build a product on any of these, read the actual license text rather than assuming open weight implies no strings attached.


Newer Models Without Public Elo Yet

Several newer open-weight releases had shipped at the time these sources were compiled but did not yet have a public Arena Elo. We do not assign scores to models that lack independent leaderboard data, so these are noted as pending rather than ranked:

  • DeepSeek V4 (Pro and Flash): Released, but no public Arena Elo recorded in these sources yet.
  • GLM-5.1: A newer revision of the top-ranked GLM family, not yet tracked on the arena.
  • Kimi K2.6: A successor to Kimi K2.5, also awaiting public Elo data.

When these models accumulate enough head-to-head votes to register a stable Arena Elo, the standings above will change. That is the nature of a live leaderboard. We would rather mark a model as pending than publish a number we cannot trace to a source.


Frequently Asked Questions

What is the highest-ranked open-weight LLM in 2026?

As of the Feb-Mar 2026 snapshot, GLM-5 from Zhipu AI holds the top Arena Elo among open-weight models at roughly 1454 on independent trackers, narrowly ahead of Alibaba's Qwen 3.5. GLM-5 is released under the MIT license, which is fully open. Rankings shift as new models are evaluated, so treat the order as directional.

Is open weight the same as open source?

No. Open weight means the trained model weights are publicly downloadable. Open source is a stricter standard tied to a permissive license. Most models on this list use MIT or Apache 2.0, which are genuinely open. Others, such as Meta Llama and Google Gemma, ship under custom licenses with restrictions, so they are open weight but not open source in the strict sense.

Why are Llama 4 and Gemma flagged separately?

Both ship under restrictive custom licenses rather than MIT or Apache 2.0. The Llama Community License imposes a 700 million monthly active user cap, a clause against training competing models, and an EU multimodal carve-out. The Gemma License is a custom acceptable-use agreement. We separate them so readers do not assume they carry the same freedoms as the MIT and Apache-licensed leaders.

How current is this Arena Elo ranking?

The Elo figures reflect a Feb-Mar 2026 snapshot drawn from LMArena, Onyx AI, LLM-Stats, and Iternal. Arena Elo moves continuously as more head-to-head votes accumulate and as new models enter. Different trackers report slightly different numbers for the same model. Verify current standings on LMArena before making a decision.

Why are DeepSeek V4 and GLM-5.1 not on the list?

Newer releases such as DeepSeek V4, GLM-5.1, and Kimi K2.6 had shipped but did not yet have a public Arena Elo at the time these sources were compiled. We do not assign scores to models that lack independent leaderboard data, so they are noted as pending rather than ranked.


Video Resources

Ranked from independent leaderboards (LMArena, LLM-Stats, Onyx AI), data as of Feb-Mar 2026. Scores shift; verify current standings.
GLM is a trademark of Zhipu AI. Qwen is a trademark of Alibaba. Kimi is a trademark of Moonshot AI. DeepSeek is a trademark of DeepSeek. Mistral is a trademark of Mistral AI. MiniMax is a trademark of MiniMax. MiMo is a trademark of Xiaomi. Step is a trademark of Stepfun. Gemma is a trademark of Google LLC. Llama is a trademark of Meta Platforms. All model and organization names are trademarks of their respective owners. Tech Jacks Solutions is editorially independent and is not affiliated with or endorsed by any organization named here.
Before You Use AI
Your Privacy

Open-weight models give you a meaningful privacy advantage: when you run the weights on your own hardware, your prompts and data never leave your environment by default. That is one of the strongest arguments for the models on this list. However, if you access these same models through a hosted API from the vendor or a third-party provider, your data is processed on remote servers under that provider's retention and training policies. Several of the organizations here are based in jurisdictions with different data protection regimes. Before sending sensitive information to any hosted endpoint, review the provider's terms. For regulated workloads, prefer local or self-hosted deployment.

Mental Health & AI Dependency

Open-weight models can be run without rate limits or paywalls, which makes round-the-clock access easy and can quietly encourage over-reliance. Stay aware of when you are using a model as a genuine tool versus a substitute for your own judgment, and remember that a high leaderboard score does not make a model's answers correct. If you or someone you know is experiencing a mental health crisis:

  • 988 Suicide & Crisis Lifeline -- Call or text 988 (US)
  • SAMHSA Helpline -- 1-800-662-4357
  • Crisis Text Line -- Text HOME to 741741

AI systems can produce plausible-sounding but incorrect guidance. For mental health, medical, legal, or financial decisions, always consult a qualified professional.

Your Rights & Our Transparency

Under GDPR and CCPA, you have the right to access, correct, and delete personal data held by any AI provider you use through a hosted service. Each vendor has its own process for exercising these rights. Tech Jacks Solutions maintains editorial independence. This ranking was not sponsored, reviewed, or approved by any organization named here. We receive no affiliate commissions tied to these models. The order reflects independent leaderboard data and our own editorial judgment, and license details were checked against vendor documentation. The EU AI Act classifies general-purpose AI systems under transparency obligations that apply to many of these models.