Pillar C: Cybersecurity of AI SystemsC11

Agentic AI Security

Agent architectures & threat surface, tool/action security, delegation & permission escalation, memory & context poisoning, multi-agent system security.

Practice C11 (311 questions)

Part of Pillar C: Cybersecurity of AI Systems · Cybersecurity of AI Systems groups the disciplines that share methods, tools, and threat models with Agentic AI Security.

What is Agentic AI Security?

Agentic AI security addresses the unique threat landscape that emerges when AI systems operate autonomously — making decisions, calling tools, delegating to sub-agents, and taking actions in the real world with minimal human oversight. Unlike traditional LLM chatbots that generate text responses, AI agents can execute code, browse the web, send emails, modify databases, manage infrastructure, and chain together multi-step workflows, dramatically expanding the blast radius of any vulnerability.

Agent architectures introduce novel attack surfaces beyond prompt injection. Tool security is critical — if an agent can call APIs, execute shell commands, or access file systems, then compromising the agent's decision-making grants the attacker the agent's full permissions. Delegation chains create transitive trust risks where a compromised sub-agent can influence parent agent behavior. Memory poisoning attacks inject malicious instructions into an agent's persistent memory or context, creating time-delayed attacks that activate in future sessions.

Securing agentic systems requires rethinking traditional security models. Least-privilege tool access, sandboxed execution environments, human-in-the-loop approval for high-risk actions, cryptographic verification of delegation chains, and adversarial testing of agent decision-making are all essential. The field is nascent but rapidly becoming critical as organizations deploy AI agents for customer service, code generation, security operations, and business process automation.

Why it matters

AI agents have real-world authority to take actions, not just generate text. A compromised agent is not just an information leak — it's an active threat with the permissions and capabilities of the systems it can access.

Agentic AI security is the frontier of AI security, extending concepts from LLM security, AI infrastructure security, and AI safety into autonomous systems that act in the world. As AI agents become the primary interface between AI models and enterprise systems, securing them becomes existential.

Layer 2

Control Access & Trust

Decide who or what can do what, enforce it cryptographically, constrain AI behaviour.

Other domains in this layer

A6Identity & Access Management A3Zero Trust Architecture A15Cryptography C8AI Safety & Alignment

See how this layer connects to the rest of the domain map →

Standards and frameworks

OWASP Top 10 for LLM Applications — LLM08 Excessive AgencyOWASP

NIST AI 600-1 (Generative AI Risk Profile)NIST

MITRE ATLAS — Agentic AI TacticsMITRE

Curated resources

Authoritative sources we ground Agentic AI Security questions in — frameworks, research, guides, and tools.

OWASPguide

Certifications that signal this domain

Credentials whose blueprint meaningfully covers this domain. Core means centrally covered; also touched means present in the blueprint but not the primary focus.