“Claude Sonnet 4.5 Is A Very Good Model” by Zvi

Update: 2025-10-02

Description

A few weeks ago, Anthropic announced Claude Opus 4.1 and promised larger announcements within a few weeks. Claude Sonnet 4.5 is the larger announcement.

Yesterday I covered the model card and related alignment concerns.

Today's post covers the capabilities side.

We don’t currently have a new Opus, but Mike Krieger confirmed one is being worked on for release later this year. For Opus 4.5, my request is to give us a second version that gets minimal or no RL, isn’t great at coding, doesn’t use tools well except web search, doesn’t work as an agent or for computer use and so on, and if you ask it for those things it suggests you hand your task off to its technical friend or does so on your behalf.

I do my best to include all substantive reactions I’ve seen, positive and negative, because right after model [...]

---

Outline:

(01:14 ) Big Talk

(02:53 ) The Big Takeaways

(04:55 ) On Your Marks

(09:25 ) Huh, Upgrades

(13:08 ) The System Prompt

(20:31 ) Positive Reactions Curated By Anthropic

(23:13 ) Other Systematic Positive Reactions

(27:24 ) Anecdotal Positive Reactions

(32:02 ) Anecdotal Negative Reactions

(40:57 ) Claude Enters Its Non-Sycophantic Era

(42:28 ) So Emotional

(48:25 ) Early Days

---

First published:

October 1st, 2025

Source:

https://www.lesswrong.com/posts/spQh5JfWXqTE5x5Wi/claude-sonnet-4-5-is-a-very-good-model

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

“Antisocial media: AI’s killer app?” by David Scott Krueger (formerly: capybaralet)

2025-10-0310:04

“Omelas Is Perfectly Misread” by Tobias H

2025-10-0308:57

“How to Feel More Alive” by Logan Riggs

2025-10-0307:57

[Linkpost] “Eliciting secret knowledge from language models” by Arthur Conmy, Bartosz Cywiński, Sam Marks

2025-10-0305:31

“Checking in on AI-2027” by Baybar

2025-10-0207:18

[Linkpost] “No, That’s Not What the Flight Costs” by Max Niederman

2025-10-0202:56

“Nice-ish, smooth takeoff (with imperfect safeguards) probably kills most ‘classic humans’ in a few decades.” by Raemon

2025-10-0222:00

“</rant> </uncharitable> </psychologizing>” by Raemon

2025-10-0203:14

“AI Safety Research Futarchy: Using Prediction Markets to Choose Research Projects for MARS” by JasonBrown

2025-10-0209:12

“Some biology related things I found interesting” by Morpheus

2025-10-0204:04

[Linkpost] “Lectures on statistical learning theory for alignment researchers” by Vanessa Kosoy

2025-10-0201:14

“Claude Sonnet 4.5 Is A Very Good Model” by Zvi

2025-10-0249:19

“‘Pessimization’ Is just Ordinary Failure” by J Bostock

2025-10-0112:56

“Halfhaven virtual blogger camp” by Viliam

2025-10-0104:58

“Claude Sonnet 4.5: System Card and Alignment” by Zvi

2025-10-0101:01:17

“The famous survivorship bias image is a ‘loose reconstruction’ of methods used on a hypothetical dataset” by Lao Mein

2025-09-3002:50

“Ethical Design Patterns” by AnnaSalamon

2025-09-3038:40

“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon

2025-09-3028:06

“What SB 53, California’s new AI law, does” by tlevin

2025-09-3008:58

“On Dwarkesh Patel’s Podcast With Richard Sutton” by Zvi

2025-09-2941:53

00:00

1.0x

“Claude Sonnet 4.5 Is A Very Good Model” by Zvi

#box-pro-ellipsis-175947657131523{-webkit-line-clamp:2;}“Claude Sonnet 4.5 Is A Very Good Model” by Zvi

“Claude Sonnet 4.5 Is A Very Good Model” by Zvi

“Claude Sonnet 4.5 Is A Very Good Model” by Zvi