Threat Modeling the AI Agent: Architecture, Threats & Monitoring

Update: 2025-11-11

Description

Are we underestimating how the agentic world is impacting cybersecurity? We spoke to Mohan Kumar, who did production security at Box for a deep dive into the threats of true autonomous AI agents.

The conversation moves beyond simple LLM applications (like chatbots) to the new world of dynamic, goal-driven agents that can take autonomous actions. Mohan took us through why this shift introduces a new class of threats we aren't prepared for, such as agents developing new, unmonitorable communication methods ("Jibber-link" mode).

Mohan shared his top three security threats for AI agents in production:

Memory Poisoning: How an agent's trusted memory (long-term, short-term, or entity memory) can be corrupted via indirect prompt injection, altering its core decisions.
Tool Misuse: The risk of agents connecting to rogue tools or MCP servers, or having their legitimate tools (like a calendar) exploited for data exfiltration.
Privilege Compromise: The critical need to enforce least-privilege on agents that can shift roles and identities, often through misconfiguration.

Guest Socials -⁠ ⁠⁠⁠⁠⁠⁠⁠⁠Mohan's Linkedin

If you want to watch videos of this LIVE STREAMED episode and past episodes - Check out our other Cloud Security Social Channels:

If you are interested in AI Cybersecurity, you can check out our sister podcast -⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ AI Security Podcast⁠

Questions asked:

(00:00 ) Introduction(01:30 ) Who is Mohan Kumar? (Production Security at Box)(03:30 ) LLM Application vs. AI Agent: What's the Difference?(06:50 ) "We are totally underestimating" AI agent threats(07:45 ) Software 3.0: When Prompts Become the New Software(08:20 ) The "Jibber-link" Threat: Agents Ditching Human Language(10:45 ) The Top 3 AI Agent Security Threats(11:10 ) Threat 1: Memory Poisoning & Context Manipulation(14:00 ) Threat 2: Tool Misuse (e.g., exploiting a calendar tool)(16:50 ) Threat 3: Privilege Compromise (Least Privilege for Agents)(18:20 ) How Do You Monitor & Audit Autonomous Agents?(20:30 ) The Need for "Observer" Agents(24:45 ) The 6 Components of an AI Agent Architecture(27:00 ) Threat Modeling: Using CSA's MAESTRO Framework(31:20 ) Are Leaks Only from Open Source Models or Closed (OpenAI, Claude) Too?(34:10 ) The "Grandma Trick": Any Model is Susceptible(38:15 ) Where is AI Agent Security Evolving? (Orchestration, Data, Interface)(42:00 ) Fun Questions: Hacking MCPs, Skydiving & Risk, Biryani

Resources mentioned during the episode:

Mohan’s Udemy Course -AI Security Bootcamp: LLM Hacking Basics

Andre Karpathy's "Software 3.0" Concept

"Jibber-link Mode" Video

CrewAI Framework

OWASP Top 10 for LLM Applications

Cloud Security Alliance (CSA) MAESTRO Framework

Comments

In Channel

Threat Modeling the AI Agent: Architecture, Threats & Monitoring

2025-11-1147:20

AI is already breaking the Silos Between AppSec & CloudSec

2025-11-0401:11:37

AI Agents for SOC: Hype Curve vs. Measurable ROI

2025-10-2836:21

Can You Build an AI SOC with Claude Code? The Reality vs. Hype

2025-10-2147:39

Incident Response of Kubernetes and how to Automate Containment

2025-10-1052:22

The Truth About AI in the SOC: From Alert Fatigue to Detection Engineering

2025-10-0345:39

The Security Gaps in AWS Bedrock & Azure AI You Need to Know

2025-09-2355:06

The Evolution of Email Security: From Pre-Breach to Post-Breach Protection

2025-09-1630:02

Using AI to Fix Your Cloud Security Backlog beyond Visibility

2025-09-0948:40

Your SecOps Team Can't Save Your Cloud: A New Blueprint for Security.

2025-08-2747:03

New Identity Blueprint for a Future with Cloud & AI

2025-08-2249:44

AI for SOC Automation: A Blueprint for the New world of Incident Response

2025-08-0852:39

The Truth About Agentic AI in the SOC: Reality vs. Hype

2025-08-0752:39

Understanding a $10B Fraud Vector in Cloud-Native Workflows

2025-07-2244:42

How BT Tackled 180 Years of Legacy to Build a Passwordless Future

2025-07-1719:51

Why Security Can Be Stricter: A Zero Trust Approach to AppSec with AI

2025-07-1545:42

Guide to Hybrid Cloud & Bare Metal Secret Management

2025-07-0932:23

"Escape-Proof" Cloud: How Block built an Automated Approach to Egress Control

2025-07-0140:27

Prioritizing Cloud Security: How to Decide What to Protect First

2025-06-2341:08

Migrating from “Tick Box" Compliance to Automating GRC in a Multi-Cloud World

2025-06-1728:48

00:00

Threat Modeling the AI Agent: Architecture, Threats & Monitoring

Cloud Security Podcast Team

#box-pro-ellipsis-176322247782558{-webkit-line-clamp:2;}Threat Modeling the AI Agent: Architecture, Threats & Monitoring

Threat Modeling the AI Agent: Architecture, Threats & Monitoring

Cloud Security Podcast Team

Threat Modeling the AI Agent: Architecture, Threats & Monitoring