Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Update: 2025-09-02

Description

Join Einar Urdshals as he introduces how Singular Learning Theory (SLT) can help advance AI safety. As AI systems grow more powerful, we need to understand how they learn and generalize to ensure they remain aligned. Einar shares how Timaeus applies mathematical frameworks to connect training data, model structure and behavior. Discover why "you are what you eat" applies to AI systems, and how understanding learning dynamics could be key to building AI that reliably acts according to human values.Einar Urdshals is a Researcher at Timaeus where he applies Singular Learning Theory to AI safety challenges. With a background in theoretical physics and mechanistic and developmental interpretability, his recent work focuses on preventing weight exfiltration by studying theoretical limits of model compression.

Comments

In Channel

Opening remarks | Kelsey Piper | EA Global: Bay Area 2025

2025-09-0244:05

The Moral Circle: insects, AI systems, and other beings who might matter | Jeff Sebo | EAG Bay Area 2025

2025-09-0244:01

Doing impactful research | Marcus Davis | EA Global: Bay Area 2025

2025-09-0254:16

AI-enabled human power grabs, and how to stop them | Tom Davidson | EA Global: Bay Area 2025

2025-09-0251:17

Strategic directions for a digital consciousness | Hayley Clatterbuck & Derek Shiller | EAG BA 2025

2025-09-0254:05

Critical levers: Where to push for farmed animals now | Zoe Sigle | EA Global: Bay Area 2025

2025-09-0246:30

Fireside chat | William MacAskill and Tom Davidson | EA Global: Bay Area 2025

2025-09-0245:39

Evidence to Policy Pipeline | Eva Vivalt | EAGxToronto 2024

2025-09-0250:22

Giving Portfolios Under Moral Uncertainty | Bob Fischer | EAGxToronto 2024

2025-09-0253:04

10,000th 10% Giving Pledge in 2024? | James Rayton | EAGxToronto 2024

2025-09-0249:53

Hepatitis C Human Challenge Studies in Toronto | Jake Eberts | EAGxToronto 2024

2025-09-0258:22

Giving Kids a Fighting Chance for a Very Low Cost | Andrew Pavao | EAGxToronto 2024

2025-09-0226:14

Wild and Farmed Animal Welfare | Kyle Johannsen | EAGxToronto 2024

2025-09-0253:39

Making AI Risk Accessible with Compelling Analogies | Darren McKee | EAGxToronto 2024

2025-09-0253:20

How Not to Waste Your Career | Matt Reardon | EAGxToronto 2024

2025-09-0254:08

Nordic Comparative Advantage | Maria Bækkelie, Laura Kull | EAGxNordics 2025

2025-09-0223:37

The Case for Prioritizing Animals in your EA Journey | Niklas Fjeldberg | EAGxNordics 2025

2025-09-0218:08

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

2025-09-0222:50

Overcoming Perfectionism and Imposter Syndrome: A CBT Perspective | Tim LeBon | EAGxNordics 2025

2025-09-0250:22

CEPI: 100 Day Mission Vaccine Development; What Does it Take? | Stig Tollefsen | EAGxNordics 2025

2025-09-0247:00

00:00

1.0x

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

#box-pro-ellipsis-176467622766787{-webkit-line-clamp:2;}Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Aaron Bergman

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025