CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150

Update: 2025-09-25

Description

CoreWeave just announced its acquisition of OpenPipe — a pivotal moment for reinforcement learning and reliable AI agents. Let’s take a step back and watch Kyle Corbitt, Co-founder and CEO of OpenPipe, talk about how reinforcement learning turns prototypes into production-ready systems. In this exclusive Imagine AI Live 25 talk, Kyle explains the “why, when, and how” of RL, walks through a case study of building an email assistant that outperformed frontier models, and shares lessons learned from designing environments and reward functions. With OpenPipe now joining forces with CoreWeave, the AI Hyperscaler™, the mission to scale reliable reinforcement learning is accelerating. Read the full announcement here.

(0:00 ) Introduction to OpenPipe and Reinforcement Learning
(0:38 ) The Steps to Training a Reliable Agent
(1:19 ) What is Reinforcement Learning?
(2:07 ) Why, When, and How to Use Reinforcement Learning
(3:30 ) How the Email Agent Works
(5:26 ) Initial Performance and Baselines
(7:42 ) Is Reinforcement Learning Practical?
(9:11 ) The First Rule of Fine-Tuning a Model
(10:11 ) When to Adopt Reinforcement Learning
(10:48 ) The Two Hard Problems of Reinforcement Learning
(11:14 ) Problem 1: Building a Realistic Environment
(13:38 ) Problem 2: The Reward Function
(15:36 ) The Training Loop
(16:47 ) Bonus: Optimizing for More Than Accuracy
(18:16 ) Guardrails: Dealing with Reward Hacking
(20:00 ) The Takeaway: Expanding the Envelope
(20:40 ) Final Thoughts and Q&A

Comments

In Channel

The Secret to 1 BILLION Views with Chris Madden, CEO of Cliptastic AI, at Imagine AI Live 25 | E156

2025-11-0518:14

AI Avatars: The Future of Global Business with Nick Warner of HeyGen | Imagine AI Live 25 | E155

2025-10-2827:15

Nathan Labenz on the Next Generation of AI Agents | The Cognitive Revolution at Imagine AI Live 25 | E154

2025-10-2132:25

How AI is Creating the Billion-Dollar Companies of Tomorrow | Michael Moe at Imagine AI Live 25 | E153

2025-10-1417:58

Beyond Memory: Dan Siroker on Personalized AI & Mind Emulation | Limitless AI @ Imagine AI 25 | E152

2025-10-0826:07

Building AI-Driven Organizational Memory w/ Knowledge Graphs | Daniel Cohen Dumani of Experio Labs | E151

2025-10-0225:44

CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150

2025-09-2521:37

AI Mediation vs. Human Lawyers: Inside the Future of Legal Tech w/ Renee Jackson | Imagine AI | E149

2025-09-0333:48

Deep Flow & The Future of Human-Machine Teams | Jonathan Gilmore on Imagine AI Live | E148

2025-08-1917:21

The Black History GPT: Fighting AI Bias with a Revolutionary New Model | Imagine AI Live 25 | E147

2025-08-1313:23

Ad Legends: How We're Reimagining Advertising with AI | Imagine AI Live 25 | E146

2025-08-1139:06

Inside Sully AI's Mission for "One Human, One Doctor" with Bhargav Patel | Imagine AI Live | E145

2025-08-0531:18

Building a Conference with AI: Imagine AI Live's Story with Founder Steve Metcalf | E144

2025-08-0415:31

The Doctor Who Fought Resistance to Save Lives w/Zara Safsovski | Imagine AI Live 25 | E143

2025-08-0109:56

The Future of Music: AI & Innovation with Yung Spielburg | Imagine AI Live 25 | E142

2025-07-3123:16

Navigating AI Ethics & Law with Positron AI's Sheryl Savage | Imagine AI Live 25 | E141

2025-07-3014:34

From Housewife to Automation Queen: An AI Story w/ Jessa | Imagine AI Live 25 | E140

2025-07-2914:33

AI's Impact: Jobs, Economy, and Our Future - with Economist Toby Madden | Imagine AI Live 25 | E139

2025-07-2816:02

How to Replace 4 Sales Tools With One AI Platform | Navin Jenovathas | Imagine AI Live 25 | E138

2025-07-2507:25

How Synthetic Data Will Revolutionize AI Training w/ CEO Jason Cohen | 137

2025-07-2441:13

00:00

1.0x

CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150

#box-pro-ellipsis-17627531834875{-webkit-line-clamp:2;}CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150

CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150

Chris Madden

CoreWeave Acquired OpenPipe — Kyle Corbitt on Reinforcement Learning & Reliable AI Agents | E150