DiscoverCursorJohn Schulman on dead ends, scaling RL, and building research institutions
John Schulman on dead ends, scaling RL, and building research institutions

John Schulman on dead ends, scaling RL, and building research institutions

Update: 2025-12-17
Share

Description

A conversation with John Schulman on the first year LLMs could have been useful, building research teams, and where RL goes from here.00:00 - Speedrunning ChatGPT09:22 - Archetypes of research managers11:56 - Was OpenAI inspired by Bell Labs?16:54 - The absence of value functions18:23 - Continual learning21:09 - Brittle generalization24:05 - Co-training generators and verifiers, GANs27:06 - John’s personal use of AI for research28:54 - Day in the life33:01 - Slowdowns in consequential ML ideas36:21 - "Peer review" within the labs39:19 - Distribution shift in researchers43:33 - Future of RL45:33 - Will the labs coordinate if the world needs them to?44:46 - Forecasting ills in AGI and engineering47:53 - Thinking Machines

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

John Schulman on dead ends, scaling RL, and building research institutions

John Schulman on dead ends, scaling RL, and building research institutions

Cursor