“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery

Update: 2025-09-21

Description

AI risk arguments often gesture at smarter AIs being "closer to a perfect utility maximizer" (and hence be more dangerous) but what does this mean, concretely? Almost anything can be modeled as a maximizer of some utility function.

The only way I can see to salvage this line of reasoning is to restrict the class of utility functions one can have such that the agent's best-fit utility function cannot be maximized until it gets very capable. The restriction may be justified on the basis of which kind of agents are unstable under real-world conditions/will get outcompeted by other agents.

What do we mean when we say a person is more or less of a perfect utility maximizer/is more or less of a "rational agent"?

With people, you can appeal to the notion of reasonable vs. unreasonable utility functions, and hence look at their divergence from a maximizer of [...]

---

Outline:

(00:48 ) What do we mean when we say a person is more or less of a perfect utility maximizer/is more or less of a rational agent?

(01:55 ) Unsatisfactory answers Ive seen

(01:59 ) A1: Its about being able to cause the universe to look more like the way you want it to

(02:24 ) A2: Its more rational if the implied utility function is simpler

(02:43 ) A3: Its the degree to which you satisfy the VNM axioms

(02:56 ) The most promising answers Ive seen are ways to formalize the reasonableness restriction

(03:02 ) A4: Its the degree to which your implied preferences are coherent over time

(03:40 ) A5: Its the degree to which your implied preferences are robust to arbitrary-seeming perturbations

---

First published:

September 21st, 2025

Source:

https://www.lesswrong.com/posts/gzAXgoy6HpjjtuLC9/what-do-people-mean-when-they-say-that-something-will-become-1

---

Narrated by TYPE III AUDIO.

Comments

In Channel

“D&D.Sci: Serial Healers [Evaluation & Ruleset]” by abstractapplic

2025-09-2307:06

“Notes on fatalities from AI takeover” by ryan_greenblatt

2025-09-2315:47

“The world’s first frontier AI regulation is surprisingly thoughtful: the EU’s Code of Practice” by MKodama

2025-09-2327:55

“Ethics-Based Refusals Without Ethics-Based Refusal Training” by 1a3orn

2025-09-2325:56

[Linkpost] “We are likely in an AI overhang, and this is bad.” by Gabriel Alfour

2025-09-2303:11

“Why I don’t believe Superalignment will work” by Simon Lermen

2025-09-2309:06

“Accelerando as a ‘Slow, Reasonably Nice Takeoff’ Story” by Raemon

2025-09-2348:33

“Rejecting Violence as an AI Safety Strategy” by James_Miller

2025-09-2308:16

“Research Agenda: Synthesizing Standalone World-Models (+ Bounties, + Seeking Funding)” by Thane Ruthenis

2025-09-2323:17

[Linkpost] “Global Call for AI Red Lines - Signed by Nobel Laureates, Former Heads of State, and 200+ Prominent Figures” by Charbel-Raphaël

2025-09-2203:21

“Focus transparency on risk reports, not safety cases” by ryan_greenblatt

2025-09-2211:45

“This is a review of the reviews” by Recurrented

2025-09-2204:17

“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery

2025-09-2104:32

“And Yet, Defend your Thoughts from AI Writing” by Michael Samoilov

2025-09-2111:43

“Astralcodexten IRB history error” by Paul Crowley

2025-09-2104:14

“Book Review: If Anyone Builds It, Everyone Dies” by Zvi

2025-09-2155:49

“Book Review: If Anyone Builds It, Everyone Dies” by Nina Panickssery

2025-09-2020:56

“Contra Collier on IABIED” by Max Harms

2025-09-2036:45

“AI Lobbying is Not Normal” by Algon

2025-09-2005:49

“The Problem with Defining an ‘AGI Ban’ by Outcome (a lawyer’s take).” by Katalina Hernandez

2025-09-2010:36

00:00

1.0x

“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery

#box-pro-ellipsis-175873059692233{-webkit-line-clamp:2;}“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery

“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery

“What do people mean when they say that something will become more like a utility maximizer?” by Nina Panickssery