DiscoverEAG TalksUnderstanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025
Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Update: 2025-09-02
Share

Description

Watch on YouTube

Join Einar Urdshals as he introduces how Singular Learning Theory (SLT) can help advance AI safety. As AI systems grow more powerful, we need to understand how they learn and generalize to ensure they remain aligned. Einar shares how Timaeus applies mathematical frameworks to connect training data, model structure and behavior. Discover why "you are what you eat" applies to AI systems, and how understanding learning dynamics could be key to building AI that reliably acts according to human values.Einar Urdshals is a Researcher at Timaeus where he applies Singular Learning Theory to AI safety challenges. With a background in theoretical physics and mechanistic and developmental interpretability, his recent work focuses on preventing weight exfiltration by studying theoretical limits of model compression.

Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Understanding AI Alignment Through Learning Theory | Einar Urdshals | EAGxNordics 2025

Aaron Bergman