“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell

Update: 2025-10-31

Description

Audio note: this article contains 86 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.

Around two months ago, John and I published Resampling Conserves Redundancy (Approximately). Fortunately, about two weeks ago, Jeremy Gillen and Alfred Harwood showed us that we were wrong.

This proof achieves, using the Jensen-Shannon divergence ("JS"), what the previous one failed to show using KL divergence ("_D_{KL}_"). In fact, while the previous attempt tried to show only that redundancy is conserved (in terms of _D_{KL}_) upon resampling latents, this proof shows that the redundancy and mediation conditions are conserved (in terms of JS).

Why Jensen-Shannon?

In just about all of our previous work, we have used _D_{KL}_ as our factorization error. (The error meant to capture the extent to which a given distribution fails to factor according to some graphical structure.) In this post I use the Jensen Shannon divergence.

_D_{KL}(U||V) := mathbb{E}_{U}lnfrac{U}{V}_

_JS(U||V) := frac{1}{2}D_{KL}left(U||frac{U+V}{2}right) + frac{1}{2}D_{KL}left(V||frac{U+V}{2}right)_

The KL divergence is a pretty fundamental quantity in information theory, and is used all over the place. (JS is usually defined in terms of _D_{KL}_, as above.) We [...]

---

Outline:

(01:04 ) Why Jensen-Shannon?

(03:04 ) Definitions

(05:33 ) Theorem

(06:29 ) Proof

(06:32 ) (1) _\\epsilon^{\\Gamma}_1 = 0_

(06:37 ) Proof of (1)

(06:52 ) (2) _\\epsilon^{\\Gamma}_2 \\leq (2\\sqrt{\\epsilon_1}+\\sqrt{\\epsilon_2})^2_

(06:57 ) Lemma 1: _JS(S||R) \\leq \\epsilon_1_

(07:10 ) Lemma 2: _\\delta(Q,R) \\leq \\sqrt{\\epsilon_1} + \\sqrt{\\epsilon_2}_

(07:20 ) Proof of (2)

(07:32 ) (3) _\\epsilon^{\\Gamma}_{med} \\leq (2\\sqrt{\\epsilon_1} + \\sqrt{\\epsilon_{med}})^2_

(07:37 ) Proof of (3)

(07:48 ) Results

(08:33 ) Bonus

The original text contained 1 footnote which was omitted from this narration.

---

First published:

October 31st, 2025

Source:

https://www.lesswrong.com/posts/JXsZRDcRX2eoWnSxo/resampling-conserves-redundancy-and-mediation-approximately

---

Narrated by TYPE III AUDIO.

Comments

In Channel

“Why Is Printing So Bad?” by johnswentworth

2025-11-0203:59

“Post title: Why I Transitioned: A Case Study” by Fiora Sunshine

2025-11-0217:22

[Linkpost] “You’re always stressed, your mind is always busy, you never have enough time” by mingyuan

2025-11-0204:18

“Re-rolling environment” by Raemon

2025-11-0102:29

“LLM-generated text is not testimony” by TsviBT

2025-11-0119:41

“Supervillain Monologues Are Unrealistic” by Algon

2025-11-0104:42

“Anthropic’s Pilot Sabotage Risk Report” by dmz

2025-11-0106:17

“OpenAI Moves To Complete Potentially The Largest Theft In Human History” by Zvi

2025-10-3137:36

“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell

2025-10-3109:12

“Steering Evaluation-Aware Models to Act Like They Are Deployed” by Tim Hua, andrq, Sam Marks, Neel Nanda

2025-10-3030:26

[Linkpost] “AISLE discovered three new OpenSSL vulnerabilities” by Jan_Kulveit

2025-10-3002:04

“Sonnet 4.5’s eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals” by Alexa Pan, ryan_greenblatt

2025-10-3035:58

“ImpossibleBench: Measuring Reward Hacking in LLM Coding Agents” by Ziqian Zhong

2025-10-3008:04

[Linkpost] “Emergent Introspective Awareness in Large Language Models” by Drake Thomas

2025-10-3003:01

“An Opinionated Guide to Privacy Despite Authoritarianism” by TurnTrout

2025-10-2908:00

“The End of OpenAI’s Nonprofit Era” by garrison

2025-10-2917:39

“Please Do Not Sell B30A Chips to China” by Zvi

2025-10-2913:15

“AI Craziness Mitigation Efforts” by Zvi

2025-10-2922:12

“Some data from LeelaPieceOdds” by Jeremy Gillen

2025-10-2914:05

“When Will AI Transform the Economy?” by Andre.Infante

2025-10-2915:43

00:00

“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell

#box-pro-ellipsis-176210042722236{-webkit-line-clamp:2;}“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell

“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell

“Resampling Conserves Redundancy & Mediation (Approximately) Under the Jensen-Shannon Divergence” by David Lorell