“AI Craziness Mitigation Efforts” by Zvi

Update: 2025-10-28

Description

AI chatbots in general, and OpenAI and ChatGPT and especially GPT-4o the absurd sycophant in particular, have long had a problem with issues around mental health.

I covered various related issues last month.

This post is an opportunity to collect links to previous coverage in the first section, and go into the weeds on some new events in the later sections. A lot of you should likely skip most of the in-the-weeds discussions.

What Are The Problems

There are a few distinct phenomena we have reason to worry about:

Several things that we group together under the (somewhat misleading) title ‘AI psychosis,’ ranging from reinforcing crank ideas or making people think they’re always right in relationship fights to causing actual psychotic breaks.
1. Thebes referred to this as three problem modes: The LLM as a social relation that draws you into madness, as an object relation [...]

---

Outline:

(00:36 ) What Are The Problems

(03:06 ) This Week In Crazy

(05:05 ) OpenAI Updates Its Model Spec

(09:00 ) Detection Rates

(11:08 ) Anthropic Says Thanks For The Memories

(12:32 ) Boundary Violations

(18:41 ) A Note On Claude Prompt Injections

(20:17 ) Conclusion

---

First published:

October 28th, 2025

Source:

https://www.lesswrong.com/posts/vrjM8qLKbiAYKAHTa/ai-craziness-mitigation-efforts

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

“On Writing #2” by Zvi

2025-11-1824:04

“GPT 5.1 Follows Custom Instructions and Glazes” by Zvi

2025-11-1843:41

“AI Craziness: Additional Suicide Lawsuits and The Fate of GPT-4o” by Zvi

2025-11-1413:13

“AI #142: Common Ground” by Zvi

2025-11-1301:31:11

“The Pope Offers Wisdom” by Zvi

2025-11-1216:25

“Kimi K2 Thinking” by Zvi

2025-11-1110:58

“Variously Effective Altruism” by Zvi

2025-11-1015:11

“On Sam Altman’s Second Conversation with Tyler Cowen” by Zvi

2025-11-0754:09

“AI #141: Give Us The Money” by Zvi

2025-11-0601:29:26

“Anthropic Commits To Model Weight Preservation” by Zvi

2025-11-0526:23

“OpenAI: The Battle of the Board: Ilya’s Testimony” by Zvi

2025-11-0409:10

“Crime and Punishment #1” by Zvi

2025-11-0301:23:51

“OpenAI Moves To Complete Potentially The Largest Theft In Human History” by Zvi

2025-10-3137:36

“AI #140: Trying To Hold The Line” by Zvi

2025-10-3001:39:32

“Please Do Not Sell B30A Chips to China” by Zvi

2025-10-2913:15

“AI Craziness Mitigation Efforts” by Zvi

2025-10-2822:12

“Asking (Some Of) The Right Questions” by Zvi

2025-10-2725:50

“New Statement Calls For Not Building Superintelligence For Now” by Zvi

2025-10-2413:53

“AI #139: The Overreach Machines” by Zvi

2025-10-2301:37:30

“On Dwarkesh Patel’s Podcast With Andrej Karpathy” by Zvi

2025-10-2157:05

00:00

1.0x

“AI Craziness Mitigation Efforts” by Zvi

#box-pro-ellipsis-176351322710135{-webkit-line-clamp:2;}“AI Craziness Mitigation Efforts” by Zvi

“AI Craziness Mitigation Efforts” by Zvi

“AI Craziness Mitigation Efforts” by Zvi