Listen Top Shows Blog

New Talk: Building Olmo 3 Think

New Talk: Building Olmo 3 Think

Update: 2025-12-10

Share

Description

It’s finally here! The public (and most complete) version of my talk covering every stage of the process to build Olmo 3 Think (slides are available). I’ve been giving this, improving it, and getting great feedback at other venues such as The Conference on Language Modeling (COLM) & The PyTorch Conference.This involves changes and new considerations of every angle of the stack, from pretraining, evaluation, and of course post-training.

Most of the talk focuses on reinforcement learning infrastructure and evaluating reasoning models, with quick comments on every training stage. I hope you enjoy it, and let us know what to improve in the future!

Chapters

* 00:00:00 Introduction

* 00:06:30 Pretraining Architecture

* 00:09:25 Midtraining Data

* 00:11:08 Long-context Necessity

* 00:13:04 Building SFT Data

* 00:20:05 Reasoning DPO Surprises

* 00:24:47 Scaling RL

* 00:41:05 Evaluation Overview

* 00:48:50 Evaluation Reflections

* 01:00:25 Conclusions

Here’s the YouTube link:

This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.interconnects.ai/subscribe

Comments

In Channel

Open models: Hot or Not with Nathan Lambert & Florian Brand

2025-12-1837:36

New Talk: Building Olmo 3 Think

New Talk: Building Olmo 3 Think

2025-12-1001:02:22

Olmo 3: America’s truly open reasoning models

Olmo 3: America’s truly open reasoning models

2025-11-2010:57

Why AI writing is mid

Why AI writing is mid

2025-11-1708:28

Interview: Ant Group's open model ambitions

Interview: Ant Group's open model ambitions

2025-11-1201:17:49

5 Thoughts on Kimi K2 Thinking

5 Thoughts on Kimi K2 Thinking

2025-11-0607:37

Burning out

2025-10-2510:09

How to scale RL

How to scale RL

2025-10-2013:01

The State of Open Models

The State of Open Models

2025-10-1647:04

Thoughts on The Curve

Thoughts on The Curve

2025-10-0711:58

ChatGPT: The Agentic App

ChatGPT: The Agentic App

2025-09-3009:24

Thinking, Searching, and Acting

Thinking, Searching, and Acting

2025-09-2209:22

Coding as the epicenter of AI progress and the path to general agents

Coding as the epicenter of AI progress and the path to general agents

2025-09-1816:18

On China's open source AI trajectory

On China's open source AI trajectory

2025-09-0913:37

Ranking the Chinese Open Model Builders

Ranking the Chinese Open Model Builders

2025-08-1712:41

Contra Dwarkesh on Continual Learning

Contra Dwarkesh on Continual Learning

2025-08-1510:04

GPT-5 and the arc of progress

GPT-5 and the arc of progress

2025-08-0710:41

gpt-oss: OpenAI validates the open ecosystem (finally)

gpt-oss: OpenAI validates the open ecosystem (finally)

2025-08-0513:36

Towards American Truly Open Models: The ATOM Project

Towards American Truly Open Models: The ATOM Project

2025-08-0422:12

Interviewing Ross Taylor on the state of AI: Chinese open models, scaling reasoning, useful tools, and what comes next

Interviewing Ross Taylor on the state of AI: Chinese open models, scaling reasoning, useful tools, and what comes next

2025-07-2901:14:40

00:00

00:00

x

New Talk: Building Olmo 3 Think

New Talk: Building Olmo 3 Think

Nathan Lambert