CAISI Adds Google DeepMind, Microsoft, and xAI to Pre-Deployment Review Program, Five Frontier Labs Now Participating

May 5, 2026 3 min read NIST / The Hill Partial

Tech Jacks Solutions AI News Coverage

Google DeepMind, Microsoft, and xAI have entered agreements with the Commerce Department's Center for AI Standards and Innovation to submit frontier models for government evaluation before public release, joining OpenAI and Anthropic in a program that has now completed more than 40 assessments. The expansion is the clearest signal yet that pre-deployment government review is becoming a baseline expectation for labs operating at the frontier in the United States.

ai-safety caisi frontier-model-evaluation us-ai-regulation government-ai-oversight pre-deployment-review

5 frontier labs, 40+ evaluations completed

Key Takeaways

Google DeepMind, Microsoft, and xAI have entered CAISI pre-deployment evaluation agreements, joining OpenAI and Anthropic, five frontier labs now participating
CAISI has completed more than 40 model evaluations to date, according to reporting from The Hill
These are government pre-deployment reviews for national security purposes, not independent third-party certifications or market approvals
The expansion signals a shift from exceptional federal relationships to a near-universal expectation for frontier labs operating in the US market
Pending White House mandatory vetting legislation, if enacted, would convert these voluntary agreements into statutory requirements

CAISI Pre-Deployment Evaluation Participants

OpenAI

Prior agreement, confirmed

Anthropic

Prior agreement, confirmed

Google DeepMind

New agreement, announced 2026-05-05

Microsoft

New agreement, announced 2026-05-05

xAI

New agreement, announced 2026-05-05

Analysis

CAISI evaluations are government pre-deployment reviews, not independent third-party assessments. The distinction matters for how organizations interpret 'evaluated' claims in federal procurement contexts, government review establishes national security awareness, not public safety certification.

Five of the world’s leading frontier AI labs now submit models for government review before public release. Google DeepMind, Microsoft, and xAI announced agreements with CAISI, the Commerce Department’s Center for AI Standards and Innovation, on May 5, according to reporting corroborated across multiple independent sources. They join OpenAI and Anthropic, both of which had prior CAISI evaluation agreements in place. Reporting from The Hill confirmed that CAISI has already completed more than 40 model evaluations to date.

CAISI is a federal agency within the National Institute of Standards and Technology. Its evaluations are government pre-deployment reviews, assessments conducted for national security and safety purposes before a model reaches the public market. This isn’t independent third-party evaluation in the academic or benchmark sense. It’s the federal government’s mechanism for understanding what frontier models can do before those capabilities become publicly available. The distinction matters: these agreements give the US government early access and structured assessment, not certification or approval authority.

The agreements reportedly cover joint safety assessments and research into cybersecurity risk mitigation, according to coverage of the CAISI announcement. The specific scope of each lab’s agreement hasn’t been disclosed in detail. What has been confirmed across multiple sources is the core structure: pre-release access for government evaluators, with the stated goal of national security risk assessment.

Why this matters: the expansion from two labs to five represents a shift in the character of these agreements. When only Anthropic and OpenAI participated, the program looked like an arrangement between the government and the two labs with the deepest federal relationships. Five labs, covering the dominant models in enterprise AI, consumer AI, and the emerging open-weights competitive tier, is a different picture. It suggests the program is becoming a condition of operating at the frontier in the US market, whether through formal mandate or through the reputational and contractual dynamics of federal procurement.

The White House executive order restoring Anthropic’s federal access earlier this spring and the CAISI agent standards initiative launched earlier this year are part of the same architecture: a federal government that is actively building evaluation infrastructure for AI systems it considers strategically significant. The CAISI expansion adds commercial pre-deployment review to that infrastructure.

What to watch: whether these agreements formalize into a published framework or remain informal arrangements. The White House has reportedly been drafting mandatory vetting legislation, a separate development that, if enacted, would convert what are currently voluntary agreements into statutory requirements. If that legislation advances, the five labs that entered voluntary agreements will have a procedural head start. Labs that didn’t will face a compliance gap they can’t close overnight.

The question compliance teams at frontier labs should be asking isn’t whether CAISI evaluation will become mandatory. It’s whether their model documentation, safety assessment processes, and cybersecurity review protocols are ready for the kind of structured scrutiny that government evaluation requires, because the infrastructure for that scrutiny is now in place for five of the most consequential AI development programs in the world.

View Source

More Regulation intelligence

View all Regulation

Gallery

Contacts

CAISI Adds Google DeepMind, Microsoft, and xAI to Pre-Deployment Review Program, Five Frontier Labs Now Participating

More from May 5, 2026

Services

Learn

Company

Gallery

Contacts

CAISI Adds Google DeepMind, Microsoft, and xAI to Pre-Deployment Review Program, Five Frontier Labs Now Participating

More from May 5, 2026

Stay ahead on Regulation

Services

Learn

Company