Exploring AI, APIs, and the Social Engineering of LLMs

Update: 2025-10-14

Description

Summary:

Timothy De Block is joined by Keith Hoodlet, Engineering Director at Trail of Bits, for a fascinating, in-depth look at AI red teaming and the security challenges posed by Large Language Models (LLMs). They discuss how prompt injection is effectively a new form of social engineering against machines, exploiting the training data's inherent human biases and logical flaws. Keith breaks down the mechanics of LLM inference, the rise of middleware for AI security, and cutting-edge attacks using everything from emojis and bad grammar to weaponized image scaling. The episode stresses that the fundamental solutions—logging, monitoring, and robust security design—are simply timeless principles being applied to a terrifyingly fast-moving frontier.

Key Takeaways

The Prompt Injection Threat

Social Engineering the AI: Prompt injection works by exploiting the LLM's vast training data, which includes all of human history in digital format, including movies and fiction. Attackers use techniques that mirror social engineering to trick the model into doing something it's not supposed to, such as a customer service chatbot issuing an unauthorized refund.
Business Logic Flaws: Successful prompt injections are often tied to business logic flaws or a lack of proper checks and guardrails, similar to vulnerabilities seen in traditional applications and APIs.
Novel Attack Vectors: Attackers are finding creative ways to bypass guardrails:
- Image Scaling: Trail of Bits discovered how to weaponize image scaling to hide prompt injections within images that appear benign to the user, but which pop out as visible text to the model when downscaled for inference.
- Invisible Text: Attacks can use white text, zero-width characters (which don't show up when displayed or highlighted), or Unicode character smuggling in emails or prompts to covertly inject instructions.
- Syntax & Emojis: Research has shown that bad grammar, run-on sentences, or even a simple sequence of emojis can successfully trigger prompt injections or jailbreaks.

Defense and Design

LLM Security is API Security: Since LLMs rely on APIs for their "tool access" and to perform actions (like sending an email or issuing a refund), security comes down to the same principles used for APIs: proper authorization, access control, and eliminating misconfiguration.
The Middleware Layer: Some companies are using middleware that sits between their application and the Frontier LLMs (like GPT or Claude) to handle system prompting, guard-railing, and filtering prompts, effectively acting as a Web Application Firewall (WAF) for LLM API calls.
Security Design Patterns: To defend against prompt injection, security design patterns are key:
- Action-Selector Pattern: Instead of a text field, users click on pre-defined buttons that limit the model to a very specific set of safe actions.
- Code-Then-Execute Pattern (CaMeL): The first LLM is used to write code (e.g., Pythonic code) based on the natural language prompt, and a second, quarantined LLM executes that safer code.
- Map-Reduce Pattern: The prompt is broken into smaller chunks, processed, and then passed to another model, making it harder for a prompt injection to be maintained across the process.
Timeless Hygiene: The most critical defenses are logging, monitoring, and alerting. You must log prompts and outputs and monitor for abnormal behavior, such as a user suddenly querying a database thousands of times a minute or asking a chatbot to write Python code.

Resources & Links Mentioned

Trail of Bits Research:
- Blog: blog.trailofbits.com
- Company Site: trailofbits.com
- Weaponizing image scaling against production AI systems
Call Me A Jerk: Persuading AI to Comply with Objectionable Requests
Securing LLM Agents Paper: Design Patterns for Securing LLM Agents against Prompt Injections.
Camel Prompt Injection
Defending LLM applications against Unicode character smuggling
Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models
LLM Explanation: Three Blue One Brown (3Blue1Brown) has a great short video explaining how Large Language Models work.
Lakera Gandalf: Game for learning how to use prompt injection against AI
Keith Hoodlet's Personal Sites:
- Website: securing.dev and thought.dev

Support the Podcast:

Enjoyed this episode? Leave us a review and share it with your network! Subscribe for more insightful discussions on information security and privacy.

Contact Information:

Leave a comment below or reach out via the contact form on the site, email timothy.deblock[@]exploresec[.]com, or reach out on LinkedIn.

Check out our services page and reach out if you see any services that fit your needs.

Social Media Links:

[RSS Feed] [iTunes] [LinkedIn][YouTube]

<label class="newsletter-form-field-label title" for="email-yui_3_17_2_1_1704234756218_68248-field">Email Address</label>
<input autocomplete="email" class="newsletter-form-field-element field-element" id="email-yui_3_17_2_1_1704234756218_68248-field" name="email" type="email" />

<button class="
newsletter-form-button
sqs-system-button
sqs-editable-button-layout
sqs-editable-button-style
sqs-editable-button-shape
sqs-button-element--primary
" type="submit" value="Sign Up">

Sign Up

</button>

We respect your privacy.

Thank you!

</form>

Comments

In Channel

Exploring the Next Frontier of IAM: Shared Signals and Data Analytics

2025-12-0250:57

How to Close the Cybersecurity Skills Gap with a Student Powered SOC

2025-11-2530:44

What is the 2025 State of the API Report From Postman?

2025-11-1847:15

How AI Will Transform Society and Affect the Cybersecurity Field

2025-11-1147:55

[RERELEASE] How Macs get Malware

2025-11-0426:16

[RERELEASE] Why communication in infosec is important - Part 2

2025-10-2826:37

[RERELEASE] Why communication in infosec is important

2025-10-2128:00

Exploring AI, APIs, and the Social Engineering of LLMs

2025-10-1452:13

How to Prepare a Presentation for a Cybersecurity Conference

2025-10-0701:01:09

Exploring the Rogue AI Agent Threat with Sam Chehab

2025-09-2339:01

A conversation with Kyle Andrus on Info Stealers and Supply Chain Attacks

2025-09-1641:29

The Winding Path to CISO: Rob Fuller's Leadership Journey

2025-09-0944:30

Kate Johnson's Winding Path to a Director Role in Cybersecurity

2025-09-0256:05

LIVE: Unraveling the SharePoint Zero-Day Exploit (CVE-2025-53770)

2025-08-2638:27

How to Launch Your Own Cybersecurity Podcast

2025-08-1938:42

How BSides St Louis Can Help Take The Next Step in Cybersecurity

2025-08-1238:27

[RERELEASE] What it's like in the SECTF sound booth

2025-08-0526:33

[RERELEASE] How to network in information security - part 2

2025-07-2914:51

[RERELEASE] How to network in information security - part 1

2025-07-2217:11

[RERELEASE] What are BEC attacks?

2025-07-1527:48

00:00

1.0x

Exploring AI, APIs, and the Social Engineering of LLMs

#box-pro-ellipsis-176502046555777{-webkit-line-clamp:2;}Exploring AI, APIs, and the Social Engineering of LLMs

Summary:

Key Takeaways

The Prompt Injection Threat

Defense and Design

Resources & Links Mentioned

Support the Podcast:

Contact Information:

Social Media Links:

Exploring AI, APIs, and the Social Engineering of LLMs

Timothy De Block

Exploring AI, APIs, and the Social Engineering of LLMs