115. Irina Rish - Out-of-distribution generalization

Update: 2022-03-09

Description

Imagine, for example, an AI that’s trained to identify cows in images. Ideally, we’d want it to learn to detect cows based on their shape and colour. But what if the cow pictures we put in the training dataset always show cows standing on grass?

In that case, we have a spurious correlation between grass and cows, and if we’re not careful, our AI might learn to become a grass detector rather than a cow detector. Even worse, we could only realize that’s happened once we’ve deployed it in the real world and it runs into a cow that isn’t standing on grass for the first time.

So how do you build AI systems that can learn robust, general concepts that remain valid outside the context of their training data?

That’s the problem of out-of-distribution generalization, and it’s a central part of the research agenda of Irina Rish, a core member of the Mila— Quebec AI Research institute, and the Canadian Excellence Research Chair in Autonomous AI. Irina’s research explores many different strategies that aim to overcome the out-of-distribution problem, from empirical AI scaling efforts to more theoretical work, and she joined me to talk about just that on this episode of the podcast.

***

Intro music:

- Artist: Ron Gelinas

- Track Title: Daybreak Chill Blend (original mix)

- Link to Track: https://youtu.be/d8Y2sKIgFWc

***

Chapters:

2:00 Research, safety, and generalization

8:20 Invariant risk minimization

15:00 Importance of scaling

21:35 Role of language

27:40 AGI and scaling

32:30 GPT versus ResNet 50

37:00 Potential revolutions in architecture

42:30 Inductive bias aspect

46:00 New risks

49:30 Wrap-up

Comments

In Channel

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

2022-10-1258:22

129. Amber Teng - Building apps with a new generation of language models

2022-10-0551:21

128. David Hirko - AI observability and data as a cybersecurity weakness

2022-09-2849:02

127. Matthew Stewart - The emerging world of ML sensors

2022-09-2141:34

126. JR King - Does the brain run on deep learning?

2022-09-1455:43

125. Ryan Fedasiuk - Can the U.S. and China collaborate on AI safety?

2022-09-0748:19

124. Alex Watson - Synthetic data could change everything

2022-05-1851:47

123. Ala Shaabana and Jacob Steeves - AI on the blockchain (it actually might just make sense)

2022-05-1254:43

122. Sadie St. Lawrence - Trends in data science

2022-05-0443:02

121. Alexei Baevski - data2vec and the future of multimodal learning

2022-04-2749:31

120. Liam Fedus and Barrett Zoph - AI scaling with mixture of expert models

2022-04-2040:47

119. Jaime Sevilla - Projecting AI progress from compute trends

2022-04-1348:34

118. Angela Fan - Generating Wikipedia articles with AI

2022-04-0651:44

117. Beena Ammanath - Defining trustworthy AI

2022-03-3046:46

116. Katya Sedova - AI-powered disinformation, present and future

2022-03-2354:24

115. Irina Rish - Out-of-distribution generalization

2022-03-0950:12

114. Sam Bowman - Are we *under-hyping* AI?

2022-03-0247:48

113. Yaron Singer - Catching edge cases in AI

2022-02-0935:20

112. Tali Raveh - AI, single cell genomics, and the new era of computational biology

2022-02-0242:04

111. Mo Gawdat - Scary Smart: A former Google exec’s perspective on AI risk

2022-01-2601:00:12

00:00

115. Irina Rish - Out-of-distribution generalization

#box-pro-ellipsis-176681236049685{-webkit-line-clamp:2;}115. Irina Rish - Out-of-distribution generalization

Chapters:

115. Irina Rish - Out-of-distribution generalization

The TDS team

115. Irina Rish - Out-of-distribution generalization