DiscoverLessWrong (30+ Karma)“Plans A, B, C, and D for misalignment risk” by ryan_greenblatt
“Plans A, B, C, and D for misalignment risk” by ryan_greenblatt

“Plans A, B, C, and D for misalignment risk” by ryan_greenblatt

Update: 2025-10-08
Share

Description

I sometimes think about plans for how to handle misalignment risk. Different levels of political will for handling misalignment risk result in different plans being the best option. I often divide this into Plans A, B, C, and D (from most to least political will required). See also Buck's quick take about different risk level regimes.


In this post, I'll explain the Plan A/B/C/D abstraction as well as discuss the probabilities and level of risk associated with each plan.


Here is a summary of the level of political will required for each of these plans and the corresponding takeoff trajectory:



  • Plan A: There is enough will for some sort of strong international agreement that mostly eliminates race dynamics and allows for slowing down (at least for some reasonably long period, e.g. 10 years) along with massive investment in security/safety work.

  • Plan B: The US [...]

---

Outline:

(02:34 ) Plan A

(04:24 ) Plan B

(05:24 ) Plan C

(05:47 ) Plan D

(06:27 ) Plan E

(07:20 ) Thoughts on these plans

The original text contained 6 footnotes which were omitted from this narration.

---


First published:

October 8th, 2025



Source:

https://www.lesswrong.com/posts/E8n93nnEaFeXTbHn5/plans-a-b-c-and-d-for-misalignment-risk


---


Narrated by TYPE III AUDIO.

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

“Plans A, B, C, and D for misalignment risk” by ryan_greenblatt

“Plans A, B, C, and D for misalignment risk” by ryan_greenblatt