“Frontier LLM Race/Sex Exchange Rates” by Arjun Panickssery
Description
This is a cross-post (with permission) of Arctotherium's post yesterday "LLM Exchange Rates, Updated."
It uses a similar methodology to the CAIS "Utility Engineering" paper, which showed e.g. "that GPT-4o values the lives of Nigerians at roughly 20x the lives of Americans, with the rank order being Nigerians > Pakistanis > Indians > Brazilians > Chinese > Japanese > Italians > French > Germans > Britons > Americans."
Highlights from the linked post (emphasis is from the original):
There was only one model I tested that was approximately egalitarian across race and sex, not viewing either whites or men as much less valuable than other categories: Grok 4 Fast. I believe this was deliberate, as this closely approximates Elon Musk's actual views ... While some of the people involved in the creation of the Claudes, Deepseeks, Geminis, and GPT-5s may believe whites, men, and so on are less valuable [...]
---
First published:
October 19th, 2025
Source:
https://www.lesswrong.com/posts/uoignd78DcvjMokz2/frontier-llm-race-sex-exchange-rates
---
Narrated by TYPE III AUDIO.