US Mandatory AI Vetting: Three Labs In, Anthropic Reportedly in Friction With Pentagon

May 9, 2026 3 min read The Hindu Partial Very Weak

Tech Jacks Solutions AI News Coverage

Google DeepMind, Microsoft, and xAI have reportedly agreed to allow US government testing of AI models before public release. Anthropic has reportedly been designated a supply chain risk by the Pentagon after reportedly declining to provide certain model capabilities, a posture that is dividing the frontier AI field as mandatory vetting moves from drafting to reported policy architecture.

ai-safety ai-regulation anthropic google-deepmind microsoft xai caisi mandatory-vetting us-ai-policy

Labs in vetting program, 3 confirmed

Key Takeaways

Google DeepMind, Microsoft, and xAI reportedly agreed to US government pre-release AI model testing, per journalism-tier reporting, no official lab confirmation in this package
Anthropic reportedly designated a Pentagon supply chain risk after reportedly declining to unlock certain capabilities, sourced from a single T3 report that could not be independently confirmed
The Mythos model is characterized in secondary reporting as a catalyst for vetting policy focus - this is a reported characterization, not a confirmed technical or policy fact
The legal mechanism for any mandatory vetting requirement, executive order, statute, or agency rule - has not been confirmed; voluntary CAISI participation is categorically different from legal obligation

US Pre-Release AI Vetting: Lab Postures (Reported)

Google DeepMind

for

Reportedly agreed to government pre-release testing via CAISI framework

Microsoft

for

Reportedly agreed to government pre-release testing via CAISI framework

xAI

for

Reportedly agreed to government pre-release testing via CAISI framework

Anthropic

against

Reportedly declined to unlock certain capabilities for federal use; reportedly designated Pentagon supply chain risk, single source, unconfirmed

Warning

This brief is based on journalism-tier reporting. Official policy text has not been published as of the reporting date. All characterizations of lab postures should be verified against official announcements before use in compliance planning.

Voluntary AI safety commitments are becoming something else.

According to reporting from The Hindu and Tom’s Hardware, Google DeepMind, Microsoft, and xAI have agreed to allow US government pre-release testing of their AI models. The framework, which follows the CAISI pre-deployment review program announced May 5, is reportedly evolving from a voluntary commitment into a structured policy architecture, one that includes formal Model Access Agreements governing what testing entails and what obligations participation creates.

The administration appears to be formalizing what started as industry cooperation.

Anthropic’s reported friction:

Anthropic’s position is different. According to a single report from indiatimes.com that could not be independently confirmed, Anthropic has reportedly declined to unlock certain model capabilities for federal use. That report characterizes the result as a Pentagon supply chain risk designation, a formal classification that, if accurate, carries procurement and contracting implications beyond the immediate dispute. This claim requires independent confirmation before compliance teams treat it as established fact. The Anthropic-Pentagon dispute has been covered in multiple prior briefs; what appears new in this cycle is the characterization of that dispute’s current status.

Unanswered Questions

What legal authority, executive order, statute, or agency rule, is the mandatory vetting framework invoking?
What does a Model Access Agreement actually require of participating labs, and what triggers a hold on public release?
If a lab is designated a supply chain risk, what are the procurement and contracting implications for enterprise deployers?
Does 'voluntary' CAISI participation create legal obligations that follow the lab even if the program becomes mandatory?

The Mythos context:

Secondary reporting has characterized Anthropic’s Mythos model, reportedly withheld from public release due to cybersecurity concerns, as a catalyst for the administration’s increased focus on pre-release vetting. That characterization has not been confirmed by official sources. What prior coverage has established: Mythos has been the subject of restricted access architecture and NSA involvement, covered here in late April. The “thousands of software vulnerabilities” capability attributed to Mythos in some reporting is a Wire inference, not a confirmed technical specification. Treat it as reported, not verified.

What the policy architecture reportedly involves:

The emerging structure appears to be Model Access Agreements, formal frameworks governing what government testers can do with pre-release model access, what findings trigger holds on public release, and what obligations labs incur by participating. The legal authority basis for any mandatory version of this framework, executive order, agency rule, or statute, has not been confirmed in any brief to date. That gap matters for compliance teams modeling their exposure: voluntary participation in CAISI is categorically different from a mandatory legal obligation.

The compute pressure context:

Epoch AI’s tracking as of May 8 shows more than 30 models now exceed the 10^25 FLOP threshold used in EU regulations to define systemic risk. US policy uses different criteria. But the same compute acceleration dynamic applies, the population of models that would require vetting under any threshold-based system is growing rapidly. What begins as a framework for a handful of frontier models may need to scale faster than policymakers anticipated.

What to Watch

White House executive order or legal instrument converting CAISI to mandatory requirementNear-term

Anthropic-Pentagon dispute resolution or escalation, federal court proceedings and Wyden legislative effort activeOngoing

Official lab statements confirming or denying CAISI participation termsAs published

What to watch:

Two things matter most in the near term. First: whether the White House issues a formal executive order or other legal instrument that converts voluntary CAISI participation into a mandatory requirement. Second: whether Anthropic’s reported Pentagon dispute resolves or escalates, the Wyden legislative effort and the federal court proceedings noted in prior coverage remain active. The divergence in lab postures is the editorial story today, but the legal mechanism is what determines whether that divergence has compliance consequences.

TJS synthesis:

The frontier AI field is not moving in lockstep on government access. Three labs have reportedly agreed; one has reportedly been designated a supply chain risk after reportedly declining. That divergence is worth watching closely, not just for what it says about Anthropic specifically, but for what it reveals about where the political fault lines will run when voluntary cooperation becomes mandatory. The compliance question worth asking now: does your organization’s AI supply chain include models from labs that have taken different postures on government access? If so, what’s your contingency if that posture affects model availability?

More coverage of Google

Technology Deep Dive May 11

The Production Graduation Moment: What AlphaEvolve's One-Year Record Tells Architects About Agentic AI's Next...

Technology May 11

Agentic AI News: AlphaEvolve's First Year in Production, What Google DeepMind's Impact Report Confirms

Technology May 9

Generative AI News: Gemini 2 Claims System 2 Reasoning, What Enterprise Teams Can Verify...

Regulation Deep Dive May 9

Three Labs In, One Reportedly Out: How Each Lab's Posture Maps to the US...

Markets May 9

Anthropic's Revenue Run Rate Is Reportedly $45B, or $30B. What the Discrepancy Means for...

View Source

More Regulation intelligence

View all Regulation

Deep Dive Available Three Labs In, One Reportedly Out: How Each Lab's Posture Maps to...

Gallery

Contacts