Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs

Update: 2025-10-09

Description

Stefano Ermon is the cofounder of Inception Labs and an associate professor at Stanford. Inception is developing a new type of AI models called Diffusion LLMs.

Stefano's favorite book: If on a Winter's Night a Traveler (Author: Italo Calvino)

(00:01 ) Introduction
(00:38 ) What are autoregressive LLMs and how do they work
(02:28 ) How diffusion LLMs rethink generation
(04:02 ) The ceiling of autoregressive LLMs: cost, latency, reliability
(06:19 ) Why diffusion LLMs are commercially viable now
(09:12 ) Parallel refinement: how diffusion models generate text
(12:05 ) Understanding diffusion steps and efficiency
(13:49 ) Hardest engineering challenges at Inception
(15:23 ) From research to production: the power of data
(16:24 ) Where diffusion LLMs still lag behind
(18:18 ) Evaluations and benchmarks for diffusion LLMs
(20:20 ) Developer experience and OpenAI-compatible API
(21:47 ) Economics and GPU efficiency
(23:38 ) Hardware and runtime stack
(24:58 ) Competition and the evolving diffusion LLM landscape
(27:01 ) Where diffusion will win first — coding and agentic systems
(30:13 ) How diffusion changes infra, serving, and hardware design
(33:04 ) What’s next at Inception: reasoning and multimodality
(35:20 ) Rapid Fire Round

--------
Where to find Stefano Ermon:

LinkedIn: https://www.linkedin.com/in/ermon/

--------
Where to find Prateek Joshi:

Research column: https://www.infrastartups.com
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-infinite
X: https://x.com/prateekvjoshi

Comments

In Channel

Passwords Are Broken: AI Agents Need Identity | Rishi Bhargava, cofounder of Descope

2025-12-1237:40

AI Agents Are Taking Over Infra | Gou Rao, CEO of NeuBird

2025-11-2634:55

Building a Visual AI Platform | Brian Moore, CEO of Voxel51

2025-11-0647:38

Building an AI Mathematician | Carina Hong, CEO of Axiom Math

2025-10-3045:01

From 0 to $15M ARR in 3 months | Mukund Jha, CEO of Emergent

2025-10-2442:05

Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs

2025-10-0939:09

LLMs, Vibe Coding, and Security | Idan Plotnik, CEO of Apiiro

2025-09-3037:36

Infra Investing | Astasia Myers, GP at Felicis

2025-09-0852:09

Putting AI On-Call for Humans | Spiros Xanthos, CEO of Resolve AI

2025-09-0239:14

Co-creator of GraphQL and Founder of Dagster Labs - Nick Schrock

2025-08-2051:55

AI Layer for Data Security | Rehan Jalil, CEO of Securiti

2025-08-0641:43

Building AI Agents for Knowledge Work | Alberto Rizzoli, CEO of V7 Labs

2025-07-3036:41

AI for Physical Security | Dave Selinger, CEO of Deep Sentinel

2025-07-2348:54

Building the Berkshire Hathaway of AI Services | Brennan Pothetes, CEO of Infinity Constellation

2025-07-1647:40

Agentic Shift in Microservices | Mark Fussell, CEO of Diagrid

2025-07-0934:08

Building the Android for Robots | Jan Liphardt, founder of OpenMind

2025-06-3046:43

AI Infra for Long Context Model Training | Anna Patterson, founder of Ceramic AI

2025-06-1739:31

Building an AI+Data Startup Studio | Tom Chavez, cofounder of super{set}

2025-06-1052:57

Decentralized Data Foundry for AI | Rowan Stone, CEO of Sapien

2025-06-0438:22

Converting Cameras into Autonomous AI Agents | Rish Gupta, CEO of Spot AI

2025-04-2938:50

00:00

1.0x

Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs

#box-pro-ellipsis-176624933778184{-webkit-line-clamp:2;}Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs

Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs

Prateek Joshi

Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs