Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

Update: 2025-10-02

Description

Sholto Douglas, a top AI researcher at Anthropic, discusses the breakthroughs behind Claude Sonnet 4.5—the world's leading coding model—and why we might be just 2-3 years from AI matching human-level performance on most computer-facing tasks.

You'll discover why RL on language models suddenly started working in 2024, how agents maintain coherency across 30-hour coding sessions through self-correction and memory systems, and why the "bitter lesson" of scale keeps proving clever priors wrong.

Sholto shares his path from top-50 world fencer to Google's Gemini team to Anthropic, explaining why great blog posts sometimes matter more than PhDs in AI research. He discusses the culture at big AI labs and why Anthropic is laser-focused on coding (it's the fastest path to both economic impact and AI-assisted AI research). Sholto also discusses how the training pipeline is still "held together by duct tape" with massive room to improve, and why every benchmark created shows continuous rapid progress with no plateau in sight.

Bold predictions: individuals will soon manage teams of AI agents working 24/7, robotics is about to experience coding-level breakthroughs, and policymakers should urgently track AI progress on real economic tasks. A clear-eyed look at where AI stands today and where it's headed in the next few years.

Anthropic

Website - https://www.anthropic.com

Twitter - https://x.com/AnthropicAI

Sholto Douglas

LinkedIn - https://www.linkedin.com/in/sholto

Twitter - https://x.com/_sholtodouglas

FIRSTMARK

Website - https://firstmark.com

Twitter - https://twitter.com/FirstMarkCap

Matt Turck (Managing Director)

LinkedIn - https://www.linkedin.com/in/turck/

Twitter - https://twitter.com/mattturck

(00:00 ) Intro

(01:09 ) The Rapid Pace of AI Releases at Anthropic

(02:49 ) Understanding Opus, Sonnet, and Haiku Model Tiers

(04:14 ) Shelto's Journey: From Australian Fencer to AI Researcher

(12:01 ) The Growing Pool of AI Talent

(16:16 ) Breaking Into AI Research Without Traditional Credentials

(18:29 ) What "Taste" Means in AI Research

(23:05 ) Moving to Google and Building Gemini's Inference Stack

(25:08 ) How Anthropic Differs from Other AI Labs

(31:46 ) Why Anthropic Is Laser-Focused on Coding

(36:40 ) Inside a 30-Hour Autonomous Coding Session

(38:41 ) Examples of What AI Can Build in 30 Hours

(43:13 ) The Breakthroughs That Enabled 30-Hour Runs

(46:28 ) What's Actually Driving the Performance Gains

(47:42 ) Pre-Training vs. Reinforcement Learning Explained

(52:11 ) Test-Time Compute and the New Scaling Paradigm

(55:55 ) Why RL on LLMs Finally Started Working

(59:38 ) Are We on Track to AGI?

(01:02:05 ) Why the "Plateau" Narrative Is Wrong

(01:03:41 ) Sonnet's Performance Across Economic Sectors

(01:05:47 ) Preparing for a World of 10–100x Individual Leverage

Comments

In Channel

Trino, Iceberg and the Battle for the Lakehouse | Justin Borgman, CEO, Starburst

2025-01-3001:06:24

How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

2025-10-1601:16:04

Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

2025-10-0201:10:03

Goodbye Excel? AI Agents for Self-Driving Finance – Pigment CEO

2025-09-1101:05:46

AI Video’s Wild Year – Runway CEO on What’s Next

2025-09-0401:04:57

How to Build a Beloved AI Product - Granola CEO Chris Pedregal

2025-08-2101:08:28

Anthropic's Surprise Hit: How Claude Code Became an AI Coding Powerhouse

2025-08-0701:00:16

Ex‑DeepMind Researcher Misha Laskin on Enterprise Super‑Intelligence | Reflection AI

2025-07-1701:06:29

The Rise of Agentic Commerce — Emily Glassberg Sands (Stripe)

2025-07-1001:15:14

AI Engineering Revolution: Winners, Chaos & What’s Next | FirstMark

2025-07-0349:53

Guillermo Rauch: Why Software Development Will Never Be the Same

2025-06-2601:45:40

Inside Canva’s $3B ARR AI Design Rocketship — CTO Brendan Humphreys on Magic Studio & Canva Code

2025-06-2056:38

GitHub CEO: The AI Coding Gold Rush, Vibe Coding & Cursor

2025-06-1201:04:46

Inside the Paper That Changed AI Forever - Cohere CEO Aidan Gomez on 2025 Agents

2025-06-0501:02:24

AI That Ends Busy Work — Hebbia CEO on “Agent Employees”

2025-05-2948:24

AI Eats the World: Benedict Evans on What Really Matters Now

2025-05-2201:15:09

Jeremy Howard on Building 5,000 AI Products with 14 People (Answer AI Deep-Dive)

2025-05-1555:02

Why Influx Rebuilt Its Database for the IoT and Robotics Explosion

2025-05-0835:35

Dashboards Are Dead: Sigma’s BI Revolution for Trillion-Row Data

2025-05-0141:32

Glean’s Breakthrough: CEO Arvind Jain on Scaling AI Agents & Search

2025-04-2452:11

00:00

1.0x

Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

#box-pro-ellipsis-176483735230776{-webkit-line-clamp:2;}Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)

Matt Turck

Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)