DiscoverAlignment Newsletter PodcastAlignment Newsletter #161: Creating generalizable reward functions for multiple tasks by learning a model of functional similarity
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Alignment Newsletter #161: Creating generalizable reward functions for multiple tasks by learning a model of functional similarity

Alignment Newsletter #161: Creating generalizable reward functions for multiple tasks by learning a model of functional similarity

We and our partners use cookies to personalize your experience, to show you ads based on your interests, and for measurement and analytics purposes. By using our website and our services, you agree to our use of cookies as described in our Cookie Policy.