Reasoning Models Sometimes Output Illegible Chains of Thought

Update: 2025-11-24

Description

TL;DR: Models trained with outcome-based RL sometimes have reasoning traces that look very weird. In this paper, I evaluate 14 models and find that many of them often generate pretty illegible CoTs. I show that models seem to find this illegible text useful, with a model's accuracy dropping heavily when given only the legible parts of its CoT, and that legibility goes down when answering harder questions. However, when sampling many responses to the same questions, I find there's no real correlation between illegible reasoning and performance. From these results (and prior work), I think it's likely RL induces meaningful illegible reasoning, but that it may not be significantly more effective than legible reasoning.

Paper | Tweet thread | Streamlit | Code

Introduction

Reasoning models are LLMs that have been trained with RLVR (Reinforcement Learning from Verifiable Rewards), often to use extended reasoning in chain-of-thought to solve tasks. This could be pretty beneficial: if this reasoning is legible and faithful, then monitoring it would be very useful. There's a lot of prior work on faithfulness, but very little on legibility—which makes sense, until recently there haven’t been models with meaningfully illegible reasoning traces.

For some reason, in practice RLVR [...]

---

Outline:

(01:08 ) Introduction

(04:38 ) How useful are illegible CoTs?

(06:29 ) Discussion

(10:46 ) Acknowledgements

The original text contained 9 footnotes which were omitted from this narration.

---

First published:

November 24th, 2025

Source:

https://www.lesswrong.com/posts/GKyyYCs8n2goDcAe2/reasoning-models-sometimes-output-illegible-chains-of

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

“Maybe Insensitive Functions are a Natural Ontology Generator?” by johnswentworth

2025-11-2508:34

The Enemy Gets The Last Hit

2025-11-2405:33

Reasoning Models Sometimes Output Illegible Chains of Thought

2025-11-2411:45

The Coalition

2025-11-2415:19

Gemini 3 Pro Is a Vast Intelligence With No Spine

2025-11-2458:42

“The LessWrong Team Was Selling Dollars For 86 Cents” by Screwtape

2025-11-2409:55

NATO is dangerously unaware of its military vulnerability

2025-11-2408:24

Inkhaven Retrospective

2025-11-2405:17

“Stop Applying And Get To Work” by plex

2025-11-2302:53

Show Review: Masquerade

2025-11-2306:13

I’ll be sad to lose the puzzles

2025-11-2304:33

You can just do things

2025-11-2305:36

Literacy is Decreasing Among the Intellectual Class

2025-11-2317:59

Traditional Food

2025-11-2317:47

Easy vs Hard Emotional Vulnerability

2025-11-2303:28

What kind of person is DeepSeek’s founder, Liang Wenfeng? An answer from his old university classmate.

2025-11-2307:31

OpenAI Locks Down San Francisco Offices Following Alleged Threat From Activist

2025-11-2307:38

Eight Heuristics of Anti-Epistemology

2025-11-2301:25

“Book Review: Wizard’s Hall” by Screwtape

2025-11-2210:08

Market Logic I

2025-11-2211:23

00:00

1.0x

Reasoning Models Sometimes Output Illegible Chains of Thought

#box-pro-ellipsis-176404994012455{-webkit-line-clamp:2;}Reasoning Models Sometimes Output Illegible Chains of Thought

Reasoning Models Sometimes Output Illegible Chains of Thought

Reasoning Models Sometimes Output Illegible Chains of Thought