Listen Top Shows Blog

36 - Adam Shai and Paul Riechers on Computational Mechanics

36 - Adam Shai and Paul Riechers on Computational Mechanics

Update: 2024-09-29

Share

Description

Sometimes, people talk about transformers as having "world models" as a result of being trained to predict text data on the internet. But what does this even mean? In this episode, I talk with Adam Shai and Paul Riechers about their work applying computational mechanics, a sub-field of physics studying how to predict random processes, to neural networks.

Patreon: https://www.patreon.com/axrpodcast

Ko-fi: https://ko-fi.com/axrpodcast

The transcript: https://axrp.net/episode/2024/09/29/episode-36-adam-shai-paul-riechers-computational-mechanics.html

Topics we discuss, and timestamps:

0:00:42 - What computational mechanics is

0:29:49 - Computational mechanics vs other approaches

0:36:16 - What world models are

0:48:41 - Fractals

0:57:43 - How the fractals are formed

1:09:55 - Scaling computational mechanics for transformers

1:21:52 - How Adam and Paul found computational mechanics

1:36:16 - Computational mechanics for AI safety

1:46:05 - Following Adam and Paul's research

Simplex AI Safety: https://www.simplexaisafety.com/

Research we discuss:

Transformers represent belief state geometry in their residual stream: https://arxiv.org/abs/2405.15943

Transformers represent belief state geometry in their residual stream [LessWrong post]: https://www.lesswrong.com/posts/gTZ2SxesbHckJ3CkF/transformers-represent-belief-state-geometry-in-their

Why Would Belief-States Have A Fractal Structure, And Why Would That Matter For Interpretability? An Explainer: https://www.lesswrong.com/posts/mBw7nc4ipdyeeEpWs/why-would-belief-states-have-a-fractal-structure-and-why

Episode art by Hamish Doodles: hamishdoodles.com

Comments

In Channel

46 - Tom Davidson on AI-enabled Coups

46 - Tom Davidson on AI-enabled Coups

2025-08-0702:05:26

45 - Samuel Albanie on DeepMind's AGI Safety Approach

45 - Samuel Albanie on DeepMind's AGI Safety Approach

2025-07-0601:15:42

44 - Peter Salib on AI Rights for Human Safety

44 - Peter Salib on AI Rights for Human Safety

2025-06-2803:21:33

43 - David Lindner on Myopic Optimization with Non-myopic Approval

43 - David Lindner on Myopic Optimization with Non-myopic Approval

2025-06-1501:40:59

42 - Owain Evans on LLM Psychology

42 - Owain Evans on LLM Psychology

2025-06-0602:14:26

41 - Lee Sharkey on Attribution-based Parameter Decomposition

41 - Lee Sharkey on Attribution-based Parameter Decomposition

2025-06-0302:16:11

40 - Jason Gross on Compact Proofs and Interpretability

40 - Jason Gross on Compact Proofs and Interpretability

2025-03-2802:36:05

38.8 - David Duvenaud on Sabotage Evaluations and the Post-AGI Future

38.8 - David Duvenaud on Sabotage Evaluations and the Post-AGI Future

2025-03-0120:42

38.7 - Anthony Aguirre on the Future of Life Institute

38.7 - Anthony Aguirre on the Future of Life Institute

2025-02-0922:39

38.6 - Joel Lehman on Positive Visions of AI

38.6 - Joel Lehman on Positive Visions of AI

2025-01-2415:28

38.5 - Adrià Garriga-Alonso on Detecting AI Scheming

38.5 - Adrià Garriga-Alonso on Detecting AI Scheming

2025-01-2027:41

38.4 - Shakeel Hashim on AI Journalism

38.4 - Shakeel Hashim on AI Journalism

2025-01-0524:14

38.3 - Erik Jenner on Learned Look-Ahead

38.3 - Erik Jenner on Learned Look-Ahead

2024-12-1223:46

39 - Evan Hubinger on Model Organisms of Misalignment

39 - Evan Hubinger on Model Organisms of Misalignment

2024-12-0101:45:47

38.2 - Jesse Hoogland on Singular Learning Theory

38.2 - Jesse Hoogland on Singular Learning Theory

2024-11-2718:18

38.1 - Alan Chan on Agent Infrastructure

38.1 - Alan Chan on Agent Infrastructure

2024-11-1624:48

38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems

38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems

2024-11-1422:42

37 - Jaime Sevilla on AI Forecasting

37 - Jaime Sevilla on AI Forecasting

2024-10-0401:44:25

36 - Adam Shai and Paul Riechers on Computational Mechanics

36 - Adam Shai and Paul Riechers on Computational Mechanics

2024-09-2901:48:27

New Patreon tiers + MATS applications

New Patreon tiers + MATS applications

2024-09-2805:32

00:00

00:00

x

36 - Adam Shai and Paul Riechers on Computational Mechanics

36 - Adam Shai and Paul Riechers on Computational Mechanics