Agent Identity: NHIs, SPIFFE/SPIRE, and the Blended Identity Model
Why treating every AI agent as a Non-Human Identity with cryptographic credentials, lifecycle governance, and blended access control is the foundation of agentic security
Every enterprise security program is built on identity. Authentication, authorization, access control, audit trails — they all start with answering one question: who is requesting access? For decades, that question was synonymous with human users. Your identity stack managed employees, contractors, and occasionally partners. Then came Non-Human Identities (NHIs) — service accounts, API keys, machine certificates, CI/CD tokens — and the ratio shifted dramatically.
NHIs outnumber human identities by 17:1 in average enterprises according to the Veza 2026 State of Identity & Access Report, with cloud-native architectures pushing ratios beyond 100:1 according to Obsidian Security research, and as high as 500:1 in advanced cloud-native architectures [1]. The same research found that 50% of organizations experienced security breaches tied to compromised machine identities — including API keys and TLS certificates — in the past year. Industry surveys report that 72% of organizations suffered at least one certificate-related outage, and only 12% have achieved automated lifecycle management for their non-human identities [2].
Traditional human IAM — quarterly access reviews, static entitlements, manual provisioning — was never designed for this scale. It fails for standard machine identities. For AI agents, the failure is catastrophic. Agents are autonomous: they make decisions and take actions without waiting for human approval. They are dynamic: their capabilities change with each prompt, tool integration, or model update. They delegate: a single agent can spawn sub-agents or chain requests across dozens of downstream systems. And they operate at machine speed: by the time your quarterly access review catches an over-provisioned agent, the damage window is measured in months.
The identity problem for AI agents is not an incremental extension of existing NHI management. It is a qualitative shift that requires new primitives: cryptographic identity, lifecycle governance tied to agent behavior rather than calendar cycles, and a blended identity model that evaluates both the agent and its human principal in a single authorization decision.
Human identity management uses the Joiner-Mover-Leaver (JML) lifecycle: onboard when hired, adjust when roles change, offboard when they leave. Agent identity requires the same rigor — but adapted for entities that are created programmatically, evolve through capability changes rather than job transfers, and can be replicated or forked without HR involvement. The four phases below map the JML pattern to agent-specific controls. Each phase has mandatory gates. Skipping gates creates the orphaned identities, credential sprawl, and permission creep that become breach vectors [1][2].
Four-Phase Agent Identity Lifecycle
Every agent entering your environment must receive a unique, cryptographically verifiable identity at creation time. This is not a service account name or an API key — it is a SPIFFE-style identity document (SVID) that binds the agent to its creation context. The registration record includes: model version and provider, hosting environment (cloud region, Kubernetes namespace, container image hash), the human owner who authorized creation, the agent's declared purpose, and the initial least-privilege policy defining what tools and data the agent can access. Without this registration gate, you cannot distinguish between legitimate agents and unauthorized deployments.
Every agent must have a named human owner who is personally accountable for its behavior. Shared ownership (“the team owns it”) means nobody owns it. When the owner changes roles or leaves the organization, transfer controls must trigger mandatory re-attestation: does the new owner understand the agent's capabilities? Do the agent's permissions still align with the new owner's business context? Orphan detection runs on a continuous schedule — any agent whose owner record is stale, whose sponsoring business unit has changed, or whose last attestation is overdue gets flagged for immediate review. Industry research consistently finds that orphaned NHIs are among the most common breach vectors because they retain access long after their business justification has expired [1].
This is where agent identity governance diverges most sharply from human IAM. Human access reviews happen quarterly or annually. Agent recertification must be event-driven: triggered by changes in the agent's capabilities, environment, or behavior — not by calendar dates. When an agent gains a new tool integration, that is a mover event requiring recertification. When an agent is promoted from dev to production, that is a mover event requiring new policies. Behavioral analytics provide the continuous monitoring layer: if an agent's tool invocation patterns drift from its baseline, that drift is an automatic trigger for policy review. Calendar-driven reviews miss the 90-day window between reviews where an agent can accumulate excessive permissions unchecked [2].
Agent retirement is more complex than human offboarding because agents leave behind artifacts that humans do not: persistent memory stores, cached tool credentials, delegation chains to other agents, and inbound references from systems that expect the agent to be available. The retirement workflow must execute in order: first, revoke all credentials and tokens so the agent cannot authenticate. Second, remove all outbound access grants to tools, APIs, and data stores. Third, block all inbound invocation paths so other agents or orchestration layers cannot trigger the retired agent. Fourth, sanitize memory and any locally cached data, applying data retention policies. Fifth, preserve the complete audit trail for compliance and forensic purposes. Finally, monitor for residual risk: are other systems still attempting to invoke the retired agent? Those invocation attempts may indicate undocumented dependencies that need remediation.
Click any phase to expand operational details and control requirements.
SPIFFE (Secure Production Identity Framework for Everyone) is a set of open standards for assigning cryptographic identities to workloads in distributed systems. SPIRE (SPIFFE Runtime Environment) is the reference implementation. Together, they solve the fundamental identity problem for AI agents: how do you prove that this specific agent, running in this specific environment, is the agent it claims to be — without relying on static secrets? [3]
The SPIFFE architecture assigns every workload a SPIFFE ID — a URI in the format spiffe://trust-domain/path/to/workload — and issues a SPIFFE Verifiable Identity Document (SVID) to prove that identity. SVIDs are short-lived X.509 certificates or JWTs that expire and auto-rotate, eliminating the static credential problem that plagues traditional NHI management. When an agent needs to authenticate to a tool, API, or peer agent, it presents its SVID. The receiving service validates the certificate chain back to the trust domain root. No passwords, no API keys, no long-lived tokens.
How SPIRE Works for Agents
SPIFFE/SPIRE was designed for microservices workload identity and is well-established in Kubernetes and cloud-native environments. Its adoption for AI agent workloads is emerging — recommended by security researchers and being incorporated into commercial identity platforms, though documented production case studies of standalone SPIFFE/SPIRE deployments specifically for agents are still limited. The migration pattern mirrors what drove microservice adoption: organizations start with static credentials, suffer a breach or outage from credential exposure, and then migrate to cryptographic identity. The difference for agents is the urgency. A compromised microservice credential typically grants access to one service. A compromised agent credential grants access to every tool, data store, and downstream system the agent can reach — which, according to the confused deputy attack pattern, may include systems the agent was never intended to access [4].
Traditional IAM answers one question at a time: “Who is this user?” or “What is this workload?” Human IAM systems (Okta, Entra ID, Ping) authenticate people. Machine identity platforms (CyberArk, HashiCorp Vault, SPIFFE/SPIRE) authenticate workloads. These two systems rarely intersect because humans and machines operate in separate contexts.
AI agents break this separation. An agent is a workload — it runs as a process, consumes compute, and authenticates to services. But it also acts on behalf of a human — the user who invoked it, the team that deployed it, the business owner who authorized its capabilities. When an agent requests access to a resource, the authorization decision must evaluate both identities simultaneously: the agent's workload identity AND the human principal's identity, at the same request, at the same time [5].
Aembit calls this pattern the Blended Identity Model — evaluating agent workload identity and human principal identity together in a single authorization decision. It answers the question traditional IAM cannot: “Is this specific user, using this specific agent, authorized to access this specific resource right now?”
Three Identity Evaluation Models
- SSO, MFA, RBAC, ABAC
- Quarterly access reviews
- Static entitlements per role
- No workload context in decisions
- Blind to agent capabilities
- X.509 certificates, JWTs, SVIDs
- Auto-rotation on TTL expiry
- Service-to-service mTLS
- No human principal context
- All users get same agent permissions
- Dual-identity evaluation per request
- Context-dependent access scoping
- Engineers get Agent+Jira, not Agent+VulnScanner
- Access narrows when either context changes
- Effective permissions = intersection of both
The practical impact is significant. Consider an enterprise where both engineering and security teams use Claude Desktop with MCP tool integrations. Under traditional IAM, the Claude Desktop agent has a fixed set of tool permissions. Under blended identity, the same agent instance running for an engineer can access Jira and GitHub but not the vulnerability scanner, while the same agent running for a security analyst can access the vulnerability scanner but not the production deployment pipeline. The agent's effective permissions are the intersection of its own capability set and the invoking user's authorization scope.
When either identity changes context, access narrows automatically. If the engineer moves to a different team, their new role may not include Jira access — and the agent's Jira tool integration is immediately scoped out for that user. If the agent gains a new tool integration, the blended model requires that both the agent AND the user are authorized for that tool before access is granted. This is the principle of least privilege applied to composite identities [5][6].
Cryptographic identity and blended access control require an enforcement point. In enterprise agent deployments, that enforcement point is the Identity Gateway — an intermediary that sits between agents and their target systems, handling authentication, authorization, and credential injection without exposing secrets to the agent itself.
Identity Gateways like the Aembit MCP Identity Gateway and the Solo.io AgentGateway implement what is called secretless credential exchange. The agent presents its identity token (SVID, JWT, or OAuth token bound to the blended identity context). The gateway validates the token, evaluates the blended identity policy, and — if authorized — injects a just-in-time (JIT) credential for the target system. The agent never holds the credential for the downstream service. It never sees it. The credential is minted for a single request and expires immediately after [5][7].
Gateway Credential Exchange Flow
For MCP deployments, the identity gateway covers all MCP server connections — every tool the agent can access goes through the gateway. For A2A (Agent-to-Agent) communication, a centralized gateway enforces mTLS, RBAC, DLP, and schema validation for inter-agent traffic. This solves the fundamental problem: agents should not hold secrets for the systems they interact with. The gateway holds the secrets. The agent holds only its own identity [5][7].
Identity gateways enable a containment capability that static credential management cannot: runtime policy enforcement. Because every agent request flows through the gateway, access decisions are made at the moment of execution — when the agent calls an MCP tool or triggers an API — not at deployment time. A single policy change can revoke a specific agent's access globally, across all connected systems, in real time.
This is fundamentally different from traditional credential rotation. When you rotate an API key, you invalidate the old key and issue a new one — but the agent may have cached the old key, or other agents may share the same key. When you revoke an agent's identity through the gateway, the revocation is targeted and complete. No credentials to rotate. No shared secrets to track. The agent's SVID is revoked at the trust domain level, and gateways reject subsequent requests from that identity. Propagation speed depends on architecture: active CRL pushing achieves near-real-time revocation, while TTL-based SVID expiry bounds the window to the remaining TTL. In-flight multi-step operations may complete their current step before revocation takes effect, so containment procedures must include transaction rollback and partial-state cleanup for workflows interrupted mid-execution [5].
For incident response, this capability transforms the containment phase. Instead of hunting for every API key an agent might hold, you execute a single action: suspend the agent's identity. Then trace all actions through delegation chains to identify scope of impact. Then analyze decision patterns to understand what was accessed. Then implement containment before resuming operations. The identity gateway's audit log provides the complete forensic record of every request the agent made, which user authorized each request, and which downstream systems were accessed [5][7].
The kill switch is not an emergency-only tool. Runtime policy enforcement also enables graduated response. Instead of full termination, you can narrow an agent's permissions in real time: restrict it to read-only access, block specific tool categories, or require human approval for all requests above a risk threshold. This graduated approach maintains service continuity while containing the threat, buying time for investigation without the business impact of a full shutdown.
Agent identity is not a standalone security concern. It maps directly to the major AI governance frameworks that enterprises are already implementing. The governance stack — NIST AI RMF, ISO 42001, and the EU AI Act — all include provisions that implicitly or explicitly require the identity controls described in this article. The table below maps each framework requirement to the specific identity capability that satisfies it [8][9][10].
Identity Controls by Framework
| Framework | Requirement | Identity Capability | Implementation |
|---|---|---|---|
| NIST AI RMF GV-1.6 | AI system inventory with sufficient detail for risk management | Agent registry with identity metadata | SPIFFE ID per agent, registration phase captures model version, owner, environment, capabilities |
| ISO 42001 A.6.2.6 | Operation and monitoring of AI systems | Identity lifecycle as a managed operational concern | JML lifecycle governance with event-driven recertification, orphan detection, formal retirement |
| EU AI Act Art. 12 | Automatic logging enabling monitoring and post-market surveillance | Identity-anchored audit trails | Gateway audit logs capture agent identity, user identity, operation, and policy decision per request |
| OWASP ASI T9 | Identity Spoofing — agent impersonation and credential theft | Cryptographic identity prevents impersonation | SPIFFE/SPIRE SVIDs with workload attestation; no static credentials to steal or replay |
| CSA MAESTRO L5 | Deployment and Infrastructure layer security controls | Identity controls at infrastructure layer | mTLS everywhere, identity gateways for MCP/A2A, JIT credential injection, runtime policy enforcement |
The OWASP Agentic Security Initiative identifies identity spoofing (T9) as a distinct threat category, noting that agents without cryptographic identity are vulnerable to impersonation attacks where a malicious process claims to be a legitimate agent and inherits its permissions [4]. SPIFFE/SPIRE directly addresses this: workload attestation proves the agent's identity through environment verification, not through shared secrets that can be stolen. The BBOM complements identity by documenting what the agent is authorized to do, while the identity infrastructure proves that this is the agent authorized to do it.
Agent identity is the foundational layer that every other agentic security control depends on. You cannot enforce least-privilege without knowing what the agent is. You cannot build audit trails without knowing who triggered the agent. You cannot contain a compromised agent without the ability to revoke its identity globally. Every security capability discussed across the Secure pillar — from threat modeling to incident response to tool misuse prevention — assumes that identity is solved.
The organizations that get agent identity right will deploy autonomous systems with confidence: every agent is cryptographically verified, every action is attributable to both an agent and a human principal, every capability change triggers recertification, and every credential is just-in-time and single-use. The organizations that skip this layer will discover that an unidentified agent with static API keys and no lifecycle governance is the most dangerous Non-Human Identity in their entire enterprise.
For the governance integration, see the Agent Governance Stack for the full NIST-ISO-EU AI Act mapping. For the threat context that identity controls defend against, start with the Agentic AI Threat Landscape. For the documentation standard that complements identity, see the BBOM guide. For the protocol layer where identity gateways operate, see the MCP deep dive. Visit the Security News Center for the latest developments in NHI security.
- NHIs outnumber humans by 17:1 on average, exceeding 100:1 in cloud-native environments. According to Obsidian Security research, 50% of organizations experienced breaches tied to compromised machine identities. AI agents amplify the risk through autonomy, delegation, and machine-speed operation.
- Agent identity requires a full JML lifecycle. Registration (cryptographic ID), ownership (named human accountable), lifecycle governance (event-driven recertification), and retirement (formal decommissioning with credential revocation, memory sanitization, and audit preservation).
- SPIFFE/SPIRE eliminates static credential sprawl. Short-lived SVIDs with workload attestation replace API keys and service account passwords. No long-lived secrets means no secrets to steal, rotate, or leak.
- The Blended Identity pattern evaluates agent + human together. Pioneered by Aembit, effective permissions are the intersection of the agent's capabilities and the invoking user's authorization scope. When either identity changes context, access narrows automatically.
- Identity Gateways enforce secretless credential exchange. Agents never hold downstream secrets. JIT credentials are minted per-request, scoped to a single operation, and expire immediately. The gateway provides the runtime policy enforcement point.
- Runtime policy enforcement is the identity-based kill switch. A single policy change revokes an agent's access globally across all connected systems in real time — no credential rotation, no shared-secret tracking, no hunting for cached keys.
- [1] Veza, "2026 State of Identity & Access Report" (17:1 NHI-to-human ratio); Obsidian Security, "What Are Machine Identities? Security Risks & Management Guide" (100:1+ cloud-native ratios, 50% breach rate via compromised machine identities). Statistics aggregated from multiple industry sources via MintMCP analysis.
- [2] Industry machine identity surveys (aggregated from Obsidian Security, MintMCP, and Keyfactor research). Certificate-related outage statistics (72%), automated lifecycle management adoption (12%). These figures reflect machine identities broadly — including API keys, TLS certificates, and service accounts — and extend to AI agent credentials.
- [3] SPIFFE: Secure Production Identity Framework for Everyone, CNCF. SPIFFE ID specification, SVID format (X.509 and JWT), trust domain architecture, workload attestation model. SPIRE reference implementation documentation.
- [4] OWASP Agentic AI Threats and Mitigations v1.0a, OWASP Agentic Security Initiative, February 2025. T9 (Identity Spoofing), T2 (Tool Misuse), Confused Deputy attack pattern.
- [5] Aembit, "MCP Identity Gateway: Secretless Credential Exchange for AI Agents," 2025. Blended identity model, JIT credential injection, runtime policy enforcement architecture.
- [6] CyberArk, "Securing Non-Human Identities in AI Agent Architectures," 2024. Per-user-per-agent access scoping, identity intersection model, credential lifecycle management.
- [7] Solo.io, "AgentGateway: Centralized Identity and Policy Enforcement for A2A Traffic," 2025. mTLS enforcement, RBAC, DLP, schema validation for inter-agent communication.
- [8] EU AI Act (Regulation 2024/1689), European Parliament and Council, 2024. Article 12 (automatic logging and record-keeping for high-risk AI systems).
- [9] NIST AI Risk Management Framework (AI RMF 1.0), NIST AI 100-1, January 2023. GV-1.6 (AI system inventory and risk management).
- [10] ISO/IEC 42001:2023, Information technology — Artificial intelligence — Management system. A.6.2.6 (operation and monitoring), Clause 8.4 (operation of AI systems).
- [11] CSA MAESTRO: Multi-Agent Environment Security Threat and Risk Operations Framework, Cloud Security Alliance, 2025. L5 (Deployment & Infrastructure) identity controls and agent attestation requirements.
Continue the Secure pillar deep dive: The Agentic AI Threat Landscape maps the full threat surface including identity spoofing (T9). Tool Misuse and Excessive Agency covers the confused deputy pattern that blended identity prevents. Agent Incident Response covers kill switch activation and forensic analysis that identity infrastructure enables. For governance integration, see the BBOM documentation guide and the Agent Governance Stack. Or explore the full Agentic AI Hub. Organizations implementing agent identity infrastructure can engage our AI Governance and Risk Management consulting, and practitioners building identity security capabilities should explore the AI Security Specialist career path.