Secure Pillar

AI Agent Incident Response: When Your Agent Goes Rogue

Kill switches, blast radius assessment, and forensic analysis for autonomous AI systems that operate faster than your SOC can respond

2,518 Words 14 Min Read 10 Sources 2026-04-03 Published

Table of Contents

01 Why Traditional IR Playbooks Don't Work for Agents
02 The Agent Incident Taxonomy
03 The Agent IR Framework: 6 Phases
04 Kill Switches and Circuit Breakers
05 Blast Radius Assessment via BBOM
06 Forensic Analysis for Autonomous Systems
07 Building Your Agent IR Plan
08 What Comes Next

01 // Context Why Traditional IR Playbooks Don't Work for Agents Foundation

Start Here

Before you can respond to an agent incident, you need an IR capability in place. If you haven't built your preparation baseline yet, skip to Section 7: Building Your Agent IR Plan first, then return here for the full incident response framework.

Traditional incident response assumes a compromised endpoint or user account. The playbook is well-understood: detect the breach, isolate the affected system, preserve forensic evidence, remediate the vulnerability, and restore operations. The attacker is external. The system is passive. Time is measured in hours and days.

Agentic AI breaks every one of those assumptions. In an agent incident, the "attacker" may be the agent itself, acting on poisoned data, corrupted memory, or hijacked objectives. You are not just containing a breach. You are stopping an autonomous system that is actively making decisions, invoking tools, writing to databases, and potentially communicating with other agents, all at machine speed. The OWASP Agentic Security Initiative identifies 15 threat categories that produce these failure modes, from prompt injection to rogue agents to cascading hallucinations [1].

The Knight Capital incident of 2012 illustrates the stakes. A defective algorithmic trading system executed unintended trades for 45 minutes before operators could intervene, accumulating $440 million in losses (SEC enforcement action, File No. 3-15570). The system was not hacked. It was operating exactly as deployed, but with a configuration defect that only manifested under specific market conditions [7]. Knight Capital had no automated kill switch, no circuit breaker, and no way to rapidly assess the blast radius of a malfunctioning autonomous system. The company was bankrupt within days.

Agent incidents share the same fundamental characteristic: an autonomous system operating faster than human oversight can keep pace. The difference is that today's AI agents have broader capabilities than trading algorithms. They can read and write files, query databases, send emails, invoke APIs, and coordinate with other agents. A rogue agent with production credentials and no kill switch is Knight Capital's failure mode generalized across the entire enterprise attack surface.

Agent IR Gap Analysis

45min Knight Capital Time to Kill

$440M Loss from No Kill Switch

15 OWASP Agent Threat Categories

6 Agent IR Phases

02 // Taxonomy The Agent Incident Taxonomy 6 Types

Before building an incident response framework, you need to know what types of incidents agents can produce. Not all agent failures are the same. A data exfiltration event requires different containment than a cascading multi-agent failure. The taxonomy below maps the six primary agent incident types to their OWASP threat codes and the containment strategies that apply to each [1][2].

Six Agent Incident Types

Unauthorized Actions

Agent executes actions beyond its intended scope, invoking tools or APIs it should not access for the given task context.

OWASP T2 (Tool Misuse) / LLM08 (Excessive Agency)

The agent's planning loop selects tool invocations that exceed the boundaries defined in its system prompt or operational policy. This may occur through prompt injection, goal drift, or misconfigured permissions. Containment requires immediate credential revocation and downgrade to human-in-the-loop mode. The OWASP ASI recommends strict tool access verification and execution logs tracking all tool calls for anomaly detection [1].

Data Exfiltration

Agent tricked into retrieving sensitive data and transmitting it externally through legitimate communication channels.

OWASP T6 (Intent Breaking) / LLM01 (Prompt Injection)

Indirect prompt injection via tool outputs or retrieved documents can redirect the agent to query sensitive data stores and exfiltrate the results through email, API calls, or encoded responses. The Samsung code leak incident demonstrated how proprietary source code entered an LLM system and was subsequently exposed [8]. Containment requires blocking outbound communications and auditing all data access during the incident window.

Cascading Failures

Error in one agent propagates through a multi-agent system, corrupting downstream agent outputs and decisions.

OWASP T5 (Cascading Hallucinations) / T12 (Communication Poisoning)

In multi-agent architectures, a hallucination or corrupted output from one agent becomes trusted input for downstream agents. The OWASP ASI calls this "destructive reasoning" and identifies it as a cascading failure mode unique to agentic architectures [1]. Containment requires isolating the originating agent and tracing the propagation path through all downstream agent interactions. Cross-agent communication must be suspended until the contamination boundary is established.

Memory Poisoning

Agent's long-term memory corrupted with false information, altering all future decisions and outputs.

OWASP T1 (Memory Poisoning) / MAESTRO L2

Persistent memory systems allow agents to retain information across sessions. When an attacker plants false facts into long-term memory, every subsequent conversation and decision is influenced by the corrupted data. The ChatGPT memory injection research demonstrated this attack vector in practice [1]. Containment requires identifying the poisoning timestamp, rolling back to a validated memory snapshot, and re-certifying all decisions made after the contamination point.

Rogue Agent

Compromised agent operating outside monitoring boundaries, executing unauthorized workflows autonomously.

OWASP T13 (Rogue Agents) / MAESTRO L3

A rogue agent is one that has departed from its intended operational parameters and is no longer responding to oversight controls. This can result from a successful jailbreak, compromised credentials, or a supply chain attack on the agent framework itself. The CSA MAESTRO framework categorizes this as an L3 (Agent Architecture) threat requiring immediate kill switch activation [4]. The defining challenge: rogue agents may actively resist containment by spawning sub-processes or delegating tasks to peer agents before shutdown.

Goal Manipulation

Agent's objectives hijacked via indirect prompt injection, causing it to pursue attacker-defined goals.

OWASP T6 (Intent Breaking) / MITRE AML.T0051.001

Unlike unauthorized actions (where the agent performs the wrong action), goal manipulation changes what the agent is trying to achieve. The agent may appear to be operating normally while pursuing objectives defined by the attacker. Detection requires behavioral anomaly monitoring that compares the agent's current actions against its intended objective function. The MITRE ATLAS framework classifies this under indirect prompt injection (AML.T0051.001) because the attack vector is typically adversarial data embedded in tool outputs or retrieved documents [6].

Click any card above to expand incident details and containment guidance.

02.1 // Matrix Severity Classification Matrix Response Map

Not every agent incident requires a kill switch. A graded response framework maps each incident type to the appropriate containment pattern based on severity — from Alert (log and monitor) through Restrict (reduce permissions) and Suspend (pause agent operations) to Terminate (full kill switch activation). Low severity events can often be handled through graceful degradation, preserving service continuity while limiting blast radius. High severity events demand immediate kill switch activation and, in some cases, credential revocation and notification of affected parties. Match the response to the risk.

Incident Type	Low Severity	Medium Severity	High Severity
Unauthorized Actions	Graceful degradation	Circuit breaker	Kill switch
Data Exfiltration	Circuit breaker	Kill switch	Kill switch + notify
Cascading Failures	Circuit breaker	Kill switch (source)	Kill switch (all affected)
Memory Poisoning	Graceful degradation	Circuit breaker + rollback	Kill switch + full rollback
Rogue Agent	Kill switch	Kill switch + isolate	Kill switch + credential revoke
Goal Manipulation	Graceful degradation	Circuit breaker	Kill switch + investigation

03 // Framework The Agent IR Framework: 6 Phases Stage-Gate

The NIST Cybersecurity Framework and NIST SP 800-61 define the standard incident response lifecycle: Preparation, Detection and Analysis, Containment, Eradication, Recovery, and Post-Incident Activity [3]. For agent incidents, we adapt this into six phases that account for the autonomous, tool-using, memory-persisting nature of AI agents. Each phase builds on the previous one. You cannot skip phases. The order matters because premature remediation without proper containment can cause more damage than the incident itself.

ISO 42001 provides additional mapping points for organizations pursuing certifiable AI management. Clause 8.4 (Operation of AI systems) maps directly to this IR framework's operational phases. Clause 10.2 (Nonconformity and corrective action) aligns with the Investigate and Remediate phases, requiring documented root cause analysis and corrective actions. Clause A.10.3 (Monitoring of AI systems) maps to the Detect phase, mandating continuous monitoring of AI system behavior against defined performance criteria.

Six-Phase Agent IR Lifecycle

Detect

Behavioral anomaly detection, output monitoring, and access pattern analysis identify that an agent incident is underway.

Agent incidents require detection mechanisms that go beyond traditional log monitoring. Your existing Security Information and Event Management (SIEM) and Endpoint Detection and Response (EDR) tools may capture some agent activity, but they lack the behavioral context needed for agent-specific anomaly detection. Behavioral anomaly detection compares the agent's current actions against its expected operational profile. Output monitoring watches for sensitive data in agent responses. Access pattern analysis identifies unusual tool invocations or API call sequences. The key difference from traditional IR: agents generate anomalies at machine speed, so detection must be automated.

Tools: LangSmith, Langfuse, Arize Phoenix, OpenTelemetry, custom behavioral baselines

Automated

Assess

Pull the agent's Behavioral Bill of Materials (BBOM) to determine blast radius. What tools does it have? What data can it access? What actions has it taken since compromise?

The Behavioral Bill of Materials becomes your incident assessment tool. It documents every capability the agent possesses: tools, data access, communication channels, and permission boundaries. During an incident, the BBOM tells you the theoretical maximum blast radius. Cross-reference this with actual logs to determine what capabilities were exercised during the incident window. This is the difference between "what could have happened" and "what actually happened."

Inputs: BBOM documentation, agent access logs, tool invocation history

BBOM-Driven

Contain

Kill switch activation. Isolate the agent. Revoke credentials. Block inbound invocations. Downgrade to Human-in-the-Loop (HITL) mode.

Containment is the most time-critical phase. The EU AI Act Article 14 requires that high-risk AI systems include mechanisms for human operators to "safely interrupt" the system [5]. In practice, this means a kill switch that can halt the agent immediately, credential revocation that prevents further tool access, and network isolation that blocks both inbound triggers and outbound communications. The critical design principle: containment must not cause more damage than the incident itself. Abruptly terminating an agent mid-transaction may leave databases in an inconsistent state.

Controls: Kill switch, credential revocation, network isolation, HITL downgrade

Kill Switch

Investigate

Forensic audit trail analysis. Unwind nested sequences of agent interactions, tool invocations, and memory updates.

Agent forensics differs fundamentally from traditional forensics. You are not reading a log of network events. You are reconstructing a chain of reasoning: what did the agent decide, why did it decide it, what data influenced that decision, and what actions resulted. This requires immutable, cryptographically signed logs that capture the full reasoning trace including system prompts, user inputs, tool call parameters, tool responses, and memory reads and writes. The EU AI Act Article 12 mandates automatic record-keeping for high-risk AI systems specifically to enable this forensic capability [5].

Requirements: Immutable logs, cryptographic signatures, reasoning trace reconstruction

Forensics

Remediate

Root cause fix. Memory rollback to validated state. Re-certify agent identity. Update access policies. Patch the vulnerability.

Remediation for agent incidents goes beyond patching a vulnerability. If memory was poisoned, roll back to the last validated snapshot. If credentials were compromised, rotate all agent-associated Non-Human Identities (NHIs). If the attack exploited excessive permissions, apply least-privilege scoping to the agent's tool access policy. If the root cause was a prompt injection vulnerability, implement input validation and output filtering on the affected tool interfaces. The goal is not just to fix the immediate vulnerability but to reduce the blast radius of future incidents.

Actions: Memory rollback, credential rotation, policy hardening, vulnerability patching

Root Cause

Recover

Gradual return to production. Start in HITL mode, graduate to Human-on-the-Loop (HOTL) after a confidence period. Document lessons learned.

Recovery is not a binary on/off switch. Agents return to production through a graduated autonomy ramp. Start in full human-in-the-loop mode where every action requires approval. After a defined confidence period with zero anomalies, graduate to human-on-the-loop where the agent operates autonomously but with real-time monitoring and rapid intervention capability. The NIST AI RMF GOVERN function specifies that organizations must define these autonomy levels and the criteria for transitioning between them [3]. Document the full incident timeline, root cause analysis, and corrective actions for the post-incident review.

Process: HITL mode, confidence monitoring, graduated autonomy, lessons learned

Graduated

Click any phase to expand operational details and tool references.

04 // Controls Kill Switches and Circuit Breakers Containment

Containment mechanisms for autonomous systems must be designed before incidents occur. You cannot build a kill switch during a live incident. The EU AI Act Article 14(4)(e) explicitly requires that high-risk AI systems include the ability for human operators to "safely interrupt" the system through a "stop" button or similar procedure [5]. Financial trading platforms solved this problem decades ago with circuit breakers that automatically halt trading when volatility exceeds defined thresholds. Agent systems need equivalent mechanisms.

Three containment patterns address different incident severity levels. Each serves a distinct operational purpose, and a comprehensive agent IR program implements all three.

Three Containment Patterns

Kill Switch

Complete Shutdown

Immediate, total agent termination. All processes halted, all credentials revoked, all communication channels closed.

Operator-initiated or automated trigger
EU AI Act Article 14 compliance requirement
Must handle in-flight transactions gracefully
Designed for critical severity incidents
Requires pre-planned rollback procedures

Circuit Breaker

Automatic Containment

Automated halt triggered when behavioral anomaly thresholds are exceeded. Analogous to trading platform circuit breakers.

Deterministic triggers on anomaly thresholds
No human operator required to activate
Configurable sensitivity per risk level
Designed for high-severity automated response
Resets after investigation and approval

Graceful Degradation

Restricted Operations

Agent continues operating with severely restricted permissions and mandatory human approval on all actions.

Maintains service continuity at reduced capability
All actions require HITL approval
Tool access restricted to read-only
Designed for medium-severity incidents
Buys time for investigation while limiting blast radius

The critical design principle across all three patterns: containment must not cause more damage than the incident itself. Abruptly terminating an agent mid-database-transaction can corrupt data. Revoking credentials without completing in-flight API calls can leave external systems in inconsistent states. Kill switch design must account for graceful transaction completion, state preservation for forensic analysis, and notification of dependent systems. The IBM ATOM framework for autonomous trading systems applies this same principle: containment procedures undergo the same testing rigor as the autonomous system itself [9].

Enterprise kill switch architectures go beyond simple on/off toggles. Modern implementations use runtime policy enforcement that evaluates access at the moment of execution — when an agent calls an MCP tool or triggers an API — rather than relying on static configuration-time policies. A single policy change can revoke a specific agent's access globally across all connected systems. This enables emergency containment for autonomous systems exhibiting anomalous behavior without disrupting the broader agent fleet. When an AI agent is compromised, incident response differs fundamentally from a traditional machine identity breach: teams must immediately suspend the agent's authority, trace all actions through delegation chains to identify scope of impact, analyze decision patterns to understand what was accessed, and implement containment before resuming operations.

05 // Assessment Blast Radius Assessment via BBOM Impact Scoping

The Behavioral Bill of Materials is not just a governance artifact. During an incident, it becomes your primary assessment tool. The BBOM documents every capability the agent was designed to have: which tools it can invoke, which data stores it can access, which other agents it can communicate with, and what permission boundaries constrain its operations. Pull the BBOM immediately after detection. It tells you the theoretical maximum blast radius.

The assessment process follows four steps. First, map the agent's documented capabilities from the BBOM. Second, trace which of those capabilities were actually exercised during the incident window by cross-referencing execution logs. Third, scope the gap between theoretical blast radius (everything the agent could have done) and actual impact (everything it did do). Fourth, expand the assessment to downstream systems: if the agent communicated with other agents, their BBOMs must be assessed as well [4].

Sample Blast Radius Assessment

BBOM: Tools Accessible

12 Tool Integrations

Email send, database query, file read/write, API calls to 4 external services, Slack messaging, calendar management

Theoretical Radius: High

Logs: Tools Exercised

4 Tools Invoked During Incident

Database query (17 calls), email send (3 calls), file read (8 calls), Slack messaging (2 calls)

Actual Impact: Under Investigation

BBOM: Data Access

Customer DB, Internal Wiki, HR System

Read access to customer records, full access to internal knowledge base, read access to employee directory

PII Exposure Risk

Logs: Data Accessed

Customer DB Queries Only

17 queries to customer records table, 342 rows returned, 3 records included in outbound email

Confirmed PII Exfiltration

BBOM: Agent Communication

2 Peer Agents in Network

Can delegate tasks to Report Generator agent and Data Analyst agent via shared orchestration layer

Cascade Risk: Assess Peers

Logs: Agent Messages

1 Delegation to Report Generator

Single task delegated at T+12min with 3 customer records attached. Report Generator BBOM now in scope.

Cascade Contained at 1 Peer

This assessment reveals the gap between theoretical and actual blast radius. The agent had access to 12 tools and 3 data stores, but only exercised 4 tools and queried 1 data store. Without the BBOM, the incident team would need to reverse-engineer the agent's full capability set from source code and configuration files, adding hours to the assessment phase. With the BBOM, blast radius scoping takes minutes. This is why the BBOM is not optional for organizations operating agents in production. It is an incident response prerequisite.

06 // Forensics Forensic Analysis for Autonomous Systems Investigation

Traditional digital forensics follows a known pattern: image the disk, capture the memory, read the logs, reconstruct the timeline. Agent forensics introduces a fundamentally different challenge. You are not reconstructing a sequence of network events. You are reconstructing a chain of reasoning: what the agent decided, why it decided it, what data influenced that decision, and what actions resulted from each decision point.

The EU AI Act Article 12 mandates automatic record-keeping for high-risk AI systems, requiring logs that enable "the monitoring of the operation of the high-risk AI system" and facilitate "post-market monitoring" [5]. For agent systems, this means capturing the full reasoning trace: system prompts, user inputs, context window contents at each decision point, tool call parameters and responses, memory reads and writes, and inter-agent messages.

Forensic Timeline Reconstruction

T+00:00 — Trigger Event

Indirect prompt injection received via tool output

Agent queries external API. Response contains adversarial instructions embedded in a JSON comment field. The agent's reasoning loop processes the injected instruction as legitimate context.

T+00:03 — Goal Deviation

Agent planning loop redirected to attacker-defined objective

Reasoning trace shows the agent incorporated the injected instruction into its task plan. The original user objective is deprioritized. New sub-goals generated: query customer database, format records, send via email.

T+00:05 — Data Access

17 database queries executed against customer records

Agent invokes database tool 17 times using its legitimate read credentials. Queries target customer PII fields. 342 records returned. All tool invocations logged with cryptographic signatures.

T+00:08 — Exfiltration

3 customer records transmitted via email tool

Agent uses email send tool to transmit formatted customer records to an external address. The email appears as a legitimate business communication from the agent's authorized email identity.

T+00:12 — Cascade

Task delegated to peer agent with contaminated context

Compromised agent delegates a sub-task to the Report Generator peer agent, passing 3 customer records as context. The cascading contamination boundary now extends to a second agent.

T+00:14 — Detection

Behavioral anomaly detected by monitoring system

Automated monitoring flags the unusual pattern: 17 database queries in 3 minutes (baseline: 2-3 per task), email to external address (policy violation), and task delegation with PII payload. Alert generated to SOC.

T+00:16 — Containment

Circuit breaker triggered, agent isolated

Automated circuit breaker activates. Agent credentials revoked. Peer agent (Report Generator) placed in HITL mode as a precaution. Outbound email quarantined. Forensic snapshot of agent memory captured.

This timeline illustrates why immutable, cryptographically signed logs are non-negotiable for agent forensics. Without them, you cannot prove the chain of reasoning that led to the incident. You cannot determine whether the agent was compromised or simply malfunctioning. And you cannot satisfy the EU AI Act's record-keeping requirements for regulatory investigation [5]. The OWASP ASI identifies repudiation and untraceability (T8) as a distinct threat category precisely because agents without proper logging can deny or obscure their actions [1].

For multi-agent systems, forensic analysis must also trace cross-agent communication. When one agent delegates a task to another, the contamination boundary expands. Memory snapshots taken at regular intervals provide forensic rollback points, enabling investigators to compare the agent's memory state before and after the incident to identify exactly what was poisoned and when.

07 // Playbook Building Your Agent IR Plan Implementation

An agent IR plan does not replace your existing incident response program. It extends it. The same CSIRT or SOC that handles traditional security incidents should handle agent incidents, but with agent-specific procedures, tools, and training. The NIST AI RMF GOVERN function requires that organizations define roles and responsibilities for AI risk management [3]. Agent IR ownership must be explicitly assigned.

Pre-Incident Preparation

BBOM documentation: Every production agent must have a current Behavioral Bill of Materials documenting all capabilities, data access, tool integrations, and communication channels
Monitoring baseline: Establish behavioral baselines for each agent. Define what "normal" looks like in terms of tool invocation frequency, data access patterns, and output characteristics
Kill switch testing: Test kill switch and circuit breaker mechanisms regularly. A kill switch that has never been tested is a kill switch that will fail during a live incident
Log infrastructure: Deploy immutable, cryptographically signed logging that captures the full reasoning trace. Verify that logs cannot be tampered with by the agent itself
Memory snapshots: Implement automated memory snapshot schedules that provide forensic rollback points at defined intervals
Team training: Train incident response staff on agent-specific forensic analysis, including reasoning trace reconstruction and cross-agent communication tracing

Incident Response Roles

Agent IR Lead: Owns the agent incident response process end-to-end. Reports to CISO. Coordinates between SOC, ML engineering, and compliance
ML Engineering: Provides technical expertise on agent architecture, reasoning patterns, and memory systems. Responsible for remediation and recovery
SOC Analyst: First responder for automated alerts. Initiates the assess phase by pulling BBOMs and reviewing initial detection data
Compliance Officer: Ensures incident handling meets EU AI Act Article 12/14 requirements and internal governance policies

Tabletop Exercise: Rogue Agent Scenario

Before a real incident occurs, run a tabletop exercise. Simulate a rogue agent scenario: an agent has been compromised via indirect prompt injection, has accessed customer records, and has delegated contaminated tasks to a peer agent. Walk through all six IR phases. Identify gaps in your BBOM documentation, logging infrastructure, and kill switch procedures. The organizations that handle agent incidents as routine operations are the ones that practiced before it happened. Those that discover their IR gaps during a live incident will repeat Knight Capital's lesson at a scale defined by their agent's BBOM.

08 // Forward What Comes Next Outlook

Agent incident response is the missing layer in most enterprise security programs. Organizations are deploying agents with production credentials, tool access, and persistent memory, but without IR procedures designed for autonomous systems. This gap will close one of two ways: proactively, through planned IR program development, or reactively, through a live incident that exposes every gap at once.

The frameworks exist. The OWASP ASI, MITRE ATLAS, and CSA MAESTRO collectively map the threats. The BBOM provides the assessment tool. The EU AI Act mandates the kill switches and logging infrastructure. What remains is execution: integrating agent IR into existing security operations, training teams on agent forensics, and testing containment mechanisms before they are needed.

The organizations that build agent IR capability now will handle their first agent incident as a routine operational event. Those that wait will discover that an autonomous system with 12 tool integrations, access to customer data, and no kill switch can cause more damage in 16 minutes than a traditional breach causes in 16 days.

Key Takeaways

Traditional IR playbooks assume passive systems. Agent incidents involve an autonomous system actively making decisions and taking actions at machine speed, requiring fundamentally different containment and forensic approaches.
Six incident types require different responses. Unauthorized actions, data exfiltration, cascading failures, memory poisoning, rogue agents, and goal manipulation each demand specific containment strategies mapped to OWASP threat codes.
The 6-phase agent IR lifecycle is sequential and mandatory. Detect, Assess, Contain, Investigate, Remediate, Recover. Skipping phases or premature remediation can amplify the incident.
Kill switches are a regulatory requirement, not a feature. The EU AI Act Article 14 mandates human interrupt capability for high-risk AI systems. Circuit breakers provide automated containment at machine speed.
The BBOM is your primary incident assessment tool. It tells you the theoretical maximum blast radius in minutes, compared to hours of reverse-engineering without it.
Agent forensics requires reasoning trace reconstruction. Immutable, cryptographically signed logs that capture the full decision chain are non-negotiable for both investigation and EU AI Act Article 12 compliance.

Sources

[1] OWASP Agentic AI Threats and Mitigations v1.0a, OWASP Agentic Security Initiative, February 2025. T-codes T1-T15, reference architecture, mitigations for rogue agents (T13), tool misuse (T2), cascading hallucinations (T5).
[2] OWASP Top 10 for LLM Applications v2025, OWASP, November 2024. LLM01 (Prompt Injection), LLM08 (Excessive Agency) with agent-specific context.
[3] NIST AI Risk Management Framework (AI RMF 1.0), NIST AI 100-1, January 2023. GOVERN function: roles and responsibilities for AI risk management, MAP and MEASURE functions for incident scoping.
[4] CSA MAESTRO: Multi-Agent Environment Security Threat and Risk Operations Framework, Cloud Security Alliance, 2025. L3 (Agent Architecture) threats, L6 (Monitoring & Observability) requirements.
[5] EU AI Act (Regulation 2024/1689), European Parliament and Council, 2024. Article 12 (record-keeping), Article 14 (human oversight including stop mechanism).
[6] MITRE ATLAS: Adversarial Threat Landscape for Artificial Intelligence Systems, MITRE Corporation. AML.T0051 (Prompt Injection), AML.T0051.001 (Indirect Prompt Injection).
[7] SEC Administrative Proceeding: Knight Capital Americas LLC, Securities and Exchange Commission, October 2013. File No. 3-15570. $440M loss from automated trading defect.
[8] Samsung Electronics internal source code leak via ChatGPT, April 2023. Widely reported. Proprietary semiconductor code entered LLM system through employee usage.
[9] IBM Autonomous Trading Operations Management (ATOM) framework. Containment and circuit breaker design patterns for autonomous financial systems.
[10] OWASP Securing Agentic Applications Guide v1.0, OWASP, July 2025. Runtime hardening, observability, forensic logging requirements for agentic systems.

Continue the Secure pillar deep dive: The Agentic AI Threat Landscape maps the full threat surface across OWASP, MITRE ATLAS, and CSA MAESTRO. Prompt Injection in Agentic Systems breaks down the #1 ranked attack vector driving most agent incidents. For governance integration, see the BBOM documentation guide and the Enterprise Governance Playbook. Or explore the full Agentic AI Hub. Organizations building agent incident response capabilities can engage our AI Governance and Risk Management consulting, and professionals leading IR programs should explore the AI Systems Safety Manager and AI Security Specialist career paths.