Where is Mistral AI headquartered?

Paris, France. Mistral is building a datacenter in the Paris area (operational mid-2026, 13,800 NVIDIA GB300 GPUs) and closed a EU1.2 billion deal for EcoDataCenter Sweden opening 2027.

Mistral AI

What Is Mistral AI? Europe's Answer to OpenAI

Q: How does Mistral AI pricing compare to GPT-4o?

Mistral Large 3 costs $0.50 per million input tokens and $1.50 per million output tokens -- significantly below comparable GPT-4o pricing. Small 4 at $0.20/$0.60 per million tokens is among the most competitive frontier-class pricing available.

In April 2023, three researchers from Europe's top AI labs built a company in under two weeks. Twelve months later, Mistral AI was valued at $13.8 billion and its models were outrunning competitors twice their size. That is not a funding story. It is an architecture story.

Mistral AI is a Paris-based AI research company that builds open-weight large language models designed to deliver frontier-class performance at a fraction of the compute cost. Founded on April 28, 2023, by Arthur Mensch (CEO), Guillaume Lample (Chief Scientist), and Timothee Lacroix (CTO) -- the same trio who co-authored Meta's original LLaMA models -- Mistral has grown to 350 employees while maintaining a research-first, sovereignty-first identity no American lab can replicate. Explore the full AI tools landscape or go deeper with Mistral AI resources.

€12B

Valuation (2025/2026)

1428

Large 3 LMSYS Elo at launch

$0.50

Per 1M input tokens, Large 3

350

Employees (2025)

Overview

What Is Mistral AI?

Mistral AI is a European AI company headquartered in Paris, France, that builds and deploys large language models under a mix of open-weight and enterprise licenses. Its core mission: create portable, customizable AI that does not sacrifice performance for scale -- and that keeps data under European jurisdiction.

The company positions itself as the standard-bearer for European AI sovereignty. Every architectural choice, from its sparse mixture-of-experts designs to its Apache 2.0 licensing strategy, flows from that premise. Where OpenAI and Anthropic operate under US law and the US Cloud Act, Mistral operates under French law with GDPR-native infrastructure and an upcoming Paris-area datacenter housing 13,800 NVIDIA GB300 GPUs across 44 megawatts of capacity.

The name itself signals intent. "Mistral" refers to the powerful cold wind sweeping from southern France into the Mediterranean -- forceful, directional, and impossible to ignore. The founders chose it deliberately.

Positioning

The European Sovereignty Angle

Data sovereignty is not a compliance checkbox for Mistral -- it is the product. European enterprises operating under GDPR face a structural problem when using US-hosted AI: the US Cloud Act gives American authorities potential access to data stored by US companies, regardless of where servers sit physically. Mistral's French legal domicile and European infrastructure address that gap directly.

The company is not classified as a high-risk provider under the EU AI Act and is a signatory to the voluntary AI Code of Practice. HSBC signed a multi-year strategic partnership with Mistral in December 2025 covering 20,000 developers. BNP Paribas is among enterprise clients drawn to the residency positioning.

Mistral received $830 million in debt financing for its Paris-area datacenter (operational mid-2026) and closed a €1.2 billion deal for EcoDataCenter Sweden (opening 2027). French President Emmanuel Macron has publicly cited the Mistral-NVIDIA collaboration as evidence of European AI capability.

The Microsoft dimension introduces nuance. In February 2024, Microsoft invested €15 million and made Mistral models available on Azure AI. The UK's Competition and Markets Authority cleared the deal as a non-merger. EU sovereignty advocates raised concerns about the partnership, and those concerns remain unresolved.

Important for sovereignty buyers: Mistral available via Azure AI is subject to standard US Cloud Act provisions — Microsoft must comply with US government data requests. For true EU data sovereignty, use Mistral's own infrastructure (La Plateforme, EU-hosted) or self-host on EU infrastructure. The Azure deployment is convenient but contradicts the sovereignty pitch.

Products

The Mistral Model Lineup: From 7B to Small 4

Mistral's model progression tracks a specific engineering thesis: smaller models with smarter architectures beat larger dense models on most real-world tasks.

Mistral 7B (September 2023)

A 7.3 billion parameter dense transformer released under Apache 2.0 on September 27, 2023. Despite fewer parameters than Meta's Llama 2 13B, it outperformed Llama 2 13B on standard benchmarks. The message was direct: parameter count is not performance.

Mixtral 8x7B (December 2023)

Where Mistral's architecture philosophy became concrete. Mixtral uses sparse mixture-of-experts (MoE): 8 expert networks total, 2 activated per token. Total parameters: 46.7 billion. Active per token: 12.9 billion. Context window: 32,000 tokens. License: Apache 2.0. The result: inference cost of a 12B model with reasoning quality of a much larger dense system.

Mistral Large 2 (July 2024)

A 123 billion parameter dense transformer with a 128,000 token context window and MMLU of 84.0%. Mistral Large 2 ships under the Mistral Research License -- NOT Apache 2.0. Commercial deployment requires an enterprise agreement. Any source claiming Large 2 is open-source is incorrect on both the license and the OSI definition.

Mistral Large 3 (December 2025)

After Large 2's restrictive license drew community criticism, Mistral returned to Apache 2.0 with Large 3 -- a deliberate strategic reversal. Large 3 is a sparse MoE: 675 billion total parameters, 41 billion active per token, 256,000 token context window, and native image understanding. Trained from scratch on 3,000 NVIDIA H200 GPUs. LMSYS Elo: approximately 1,428 -- the #2 open-source non-reasoning model (using LMSYS Chatbot Arena's category label) at launch. API pricing: $0.50/$1.50 per million tokens (input/output).

Mistral Small 4 (March 2026)

The most recent flagship. Small 4 uses 128 experts with 4 active per token. Total parameters: 119 billion. Active per token: 6-8 billion. Context window: 256,000 tokens. License: Apache 2.0. It consolidates reasoning (Magistral), multimodal vision (Pixtral), and coding (Devstral) into one model -- 40% faster, 3x more requests per second vs its predecessor. LMSYS Elo: approximately 1,410. API pricing: $0.20/$0.60 per million tokens.

Company Timeline

Apr 2023

Mistral AI Founded

Mensch, Lample, Lacroix launch from Paris -- ex-DeepMind and Meta AI LLaMA authors

Sep 2023

Mistral 7B Released

7.3B dense transformer, Apache 2.0 -- outperforms Llama 2 13B on key benchmarks

Dec 2023

Mixtral 8x7B Launches

Sparse MoE, 46.7B total / 12.9B active -- introduces Mistral's signature architecture

Jul 2024

Mistral Large 2 Released

123B parameters, 128K context -- ships under Mistral Research License (not Apache 2.0)

Dec 2025

Mistral Large 3 -- Return to Open

675B total / 41B active, Apache 2.0, native vision, 1,428 Elo -- strategic reversal

Mar 2026

Mistral Small 4 Launched

119B total / 6-8B active, 128 experts, Apache 2.0, unified reasoning+vision+coding

Transparency

Open-Weight vs. Open-Source: The Distinction That Matters

Mistral describes its models as open-weight. That is the accurate term. It is not the same as open-source.

The OSI Open Source AI Definition 1.0, published in October 2024, requires that a model's training data be accessible for inspection and use -- not just the model weights. Mistral releases weights freely. It does not disclose training data. Under the OSI definition, Mistral's models do not qualify as fully open-source.

Critics have used the term "openwashing" to describe companies that benefit from the open-source brand without meeting full transparency requirements. That characterization is fair by OSI standards. The practical effect for most users: Apache 2.0 models can be downloaded, deployed, and fine-tuned commercially without API costs or data leaving your infrastructure. The training data audit trail is not part of that package.

Apache 2.0 (open-weight): Mistral 7B, Mixtral 8x7B, Large 3, Small 4
Restricted licenses: Large 2 (Research License -- commercial requires enterprise agreement), Codestral (Non-Production License -- commercial by request)

Pricing

API Pricing and Self-Hosting Economics

Mistral's La Plateforme API pricing as of April 2026:

Model	Input (per 1M tokens)	Output (per 1M tokens)
Mistral Large 3	$0.50	$1.50
Mistral Large 2	$3.00	$9.00
Mistral Small 4	$0.20	$0.60
Codestral	$1.00	$3.00
Ministral 3B	$0.04	$0.04
Embeddings	$0.01	$0.01

For deployments exceeding 10 million tokens per month and where model quality within 10% of proprietary models is acceptable, self-hosting open-weight models can cost 8-12x less than proprietary APIs. Both conditions are required. Below 10 million tokens, managed API costs likely come out ahead once infrastructure overhead is factored in.

Mistral Large 3 can run on a single node of 8x NVIDIA H100 or A100 GPUs with no inter-node parallelism required -- a meaningful operational simplification for enterprise infrastructure teams.

Enterprise

Enterprise Products: Le Chat and Mistral Forge

Le Chat

Mistral's consumer and enterprise AI workspace. The team plan runs $24.99 per user per month (monthly) or $19.99 per user per month (annual), with shared knowledge bases, an admin console, and data sharing opt-out. The enterprise tier adds SAML SSO, zero data retention options, and connectors to Google Drive, OneDrive, SharePoint, GitHub, Google Calendar, and Gmail. Deployment options: self-hosted, private cloud, or Mistral-hosted.

Mistral Forge

Announced at NVIDIA GTC in March 2026, Mistral Forge goes beyond fine-tuning. It is an enterprise platform for full pre-training, post-training, and reinforcement learning on proprietary data. For organizations with large enough proprietary datasets to justify custom model development -- financial institutions, healthcare systems, legal networks -- Forge provides a path to truly private AI without starting from scratch.

⚙

La Plateforme API

REST API access to all Mistral models with per-token pricing

💬

Le Chat

Consumer and team chatbot workspace -- free and team tiers available

🏢

Le Chat Enterprise

SAML SSO, zero data retention, Drive/OneDrive/GitHub connectors

🖥

Self-Hosted

Apache 2.0 models on your own GPU infrastructure -- Large 3 on 8x H100

☁

Azure AI

Mistral models via Microsoft Azure -- Feb 2024 partnership, CMA cleared

Note: Azure AI deployments are subject to US Cloud Act jurisdiction. Evaluate for EU data sovereignty requirements.

Performance

Benchmark Performance

Note: Benchmark comparisons reflect scores at the time of Mistral Large 3's release (December 2025). The competitive landscape for AI benchmarks shifts frequently — check LMSYS Chatbot Arena and Papers With Code for current standings.

Mistral Large 3 benchmark results (Apache 2.0, December 2025, sourced from LMSYS Chatbot Arena and published model cards):

Note: GPT-4o and Claude 3.5 Sonnet were the leading comparators at Mistral Large 3's December 2025 release. See LMSYS Chatbot Arena for current standings.

Benchmark	Mistral Large 3	GPT-4o	Claude 3.5 Sonnet
MMLU	~85.5%	88.7%	88.7%
HumanEval (coding)	~92.0%	90.2%	--
GSM8K (math reasoning)	~93.6%	--	--
MATH	~88.0%	--	--
LMSYS Chatbot Arena Elo	~1,428	--	--

MMLU vs HumanEval -- Large 3 vs Competitors

MMLU Score Source: published model cards, 2025

GPT-4o

88.7%

Claude 3.5 Sonnet

88.7%

Mistral Large 3

85.5%

HumanEval (Coding) Source: published model cards, 2025

Mistral Large 3

92.0%

GPT-4o

90.2%

On coding benchmarks, Large 3 leads GPT-4o. On MMLU, it trails GPT-4o and Claude 3.5 Sonnet by approximately 3 percentage points. LMSYS leaderboard context: GPT-5.5 sits at 1,506 Elo, Claude Opus 4.6 Thinking at 1,504, Gemini 3.1 Pro at 1,493. Mistral is elite tier. It is not #1.

The MATH benchmark jump from Large 2 (47.5%) to Large 3 (88.0%) reflects architectural changes that fundamentally improved mathematical reasoning -- not incremental tuning.

Fit Guide

Who Should Use Mistral AI?

🏛

European Enterprise

GDPR-native, French legal domicile, no US Cloud Act exposure -- structural advantage for EU-regulated organizations

💻

Open-Source Developer

Apache 2.0 on Large 3 and Small 4 -- download, self-host, fine-tune, build commercially with no API dependency

📈

Cost-Sensitive Startup

$0.20/$0.60 per 1M tokens (Small 4) -- significantly below GPT-4o equivalents for token-intensive workloads

🏥

Healthcare / Legal / Finance

Data residency requirements met via self-hosted deployment plus European infrastructure -- patient and client data stays in jurisdiction

⚠

Large 2 License Restriction

Mistral Large 2 ships under the Mistral Research License -- NOT Apache 2.0. Commercial deployment requires a separate enterprise agreement. Confirm your model version before building production pipelines.

ℹ

Open-Weight, Not Fully Open-Source

Mistral releases model weights freely but does not disclose training data. Under OSI's Open Source AI Definition 1.0 (Oct 2024), Mistral does not meet the full open-source definition. Critics use the term "openwashing" -- a fair characterization per OSI standards.

📊

MMLU Slightly Below Top Proprietary

Mistral Large 3 scores approximately 85.5% on MMLU vs. 88.7% for GPT-4o and Claude 3.5 Sonnet. The 3-point gap is real. For tasks where MMLU-class reasoning matters most, proprietary models currently lead.

FAQ

Frequently Asked Questions

Mistral releases model weights under Apache 2.0 for several models, but does not disclose training data. Under the OSI Open Source AI Definition 1.0 (October 2024), Mistral is "open-weight" but not fully open-source. Practically: you can download, deploy, and fine-tune models commercially -- but the training data audit trail is not part of the package.

No. Mistral Large 2 ships under the Mistral Research License, which permits research and evaluation but requires an enterprise agreement for commercial use. Mistral Large 3 (December 2025) returned to Apache 2.0 -- a deliberate strategic reversal after Large 2's restrictions drew community criticism.

Mistral Large 3 runs at $0.50/$1.50 per million input/output tokens -- significantly below comparable GPT-4o pricing. Small 4 at $0.20/$0.60 per million tokens is among the most competitive frontier-class pricing available. For deployments exceeding 10 million tokens per month and where model quality within 10% of proprietary models is acceptable, self-hosting open-weight models can cost 8-12x less than proprietary APIs.

Paris, France. Mistral is building a dedicated datacenter in the Paris area (operational mid-2026, 13,800 NVIDIA GB300 GPUs, 44 MW) and closed a €1.2 billion deal for EcoDataCenter Sweden opening 2027. This European infrastructure footprint is central to its sovereign AI positioning.

Yes. Apache 2.0 models can be deployed on private infrastructure. Mistral Large 3 runs on a single node of 8x NVIDIA H100 or A100 GPUs with no inter-node parallelism required. Le Chat Enterprise supports self-hosted deployment with SAML SSO and zero data retention configurations.

Video Resources

Learn More: Mistral AI Videos

▶

What Is Mistral AI? Model Overview

YouTube -- Placeholder

▶

Mistral vs GPT-4o: Benchmark Deep Dive

YouTube -- Placeholder

▶

Self-Hosting Mistral Large 3 on H100s

YouTube -- Placeholder

Before You Use AI

Your Privacy

Mistral's La Plateforme API and Le Chat Enterprise offer zero data retention options. Free and team tiers may use conversation data to improve models -- check your plan's data policy. Self-hosted deployments keep all data on your infrastructure. Review Mistral's privacy policy before sending sensitive data to any cloud-hosted AI.

Mental Health & AI Dependency

AI tools are not substitutes for professional mental health support. If you or someone you know is in crisis: 988 Suicide & Crisis Lifeline (call or text 988), SAMHSA National Helpline 1-800-662-4357, or Crisis Text Line (text HOME to 741741). Review the NIST AI Risk Management Framework for organizational guidance.

Your Rights & Our Transparency

Under GDPR (EU) and CCPA (California), you have rights to access, correct, or delete personal data processed by AI systems. This article was written by a human editorial team with AI research assistance. No affiliate relationship with Mistral AI exists. Benchmark data cited by source and date. EU AI Act overview.

Explore More AI Tools

Breakdown

What Is DeepSeek? China's Open-Source AI

How DeepSeek's R1 model shook the AI cost curve and what it means for open-weight competition.

Breakdown

What Is Meta Llama? The Open-Weight Giant

Meta's Llama family -- the open-weight models that reset expectations for what free AI can do.

Breakdown

What Is ChatGPT? OpenAI's Flagship Explained

GPT-4o, pricing, capabilities, and how ChatGPT compares to European open-weight alternatives.

Gallery

Contacts

What Is Mistral AI? Europe's Answer to OpenAI

What Is Mistral AI?

The European Sovereignty Angle

The Mistral Model Lineup: From 7B to Small 4

Mistral 7B (September 2023)

Mixtral 8x7B (December 2023)

Mistral Large 2 (July 2024)

Mistral Large 3 (December 2025)

Mistral Small 4 (March 2026)

Open-Weight vs. Open-Source: The Distinction That Matters

API Pricing and Self-Hosting Economics

Enterprise Products: Le Chat and Mistral Forge

Le Chat

Mistral Forge

Benchmark Performance

Who Should Use Mistral AI?

Frequently Asked Questions

Learn More: Mistral AI Videos

Before You Use AI

Your Privacy

Mental Health & AI Dependency

Your Rights & Our Transparency

Explore More AI Tools

Services

Learn

Company