Microsoft CTO Kevin Scott on How Far Scaling Laws Will Extend

Update: 2024-07-09

Description

The current LLM era is the result of scaling the size of models in successive waves (and the compute to train them). It is also the result of better-than-Moore’s-Law price vs performance ratios in each new generation of Nvidia GPUs. The largest platform companies are continuing to invest in scaling as the prime driver of AI innovation.

Are they right, or will marginal returns level off soon, leaving hyperscalers with too much hardware and too few customer use cases? To find out, we talk to Microsoft CTO Kevin Scott who has led their AI strategy for the past seven years. Scott describes himself as a “short-term pessimist, long-term optimist” and he sees the scaling trend as durable for the industry and critical for the establishment of Microsoft’s AI platform.

Scott believes there will be a shift across the compute ecosystem from training to inference as the frontier models continue to improve, serving wider and more reliable use cases. He also discusses the coming business models for training data, and even what ad units might look like for autonomous agents.

Hosted by: Pat Grady and Bill Coughran, Sequoia Capital

Mentioned:

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, the 2018 Google paper that convinced Kevin that Microsoft wasn’t moving fast enough on AI.

Dennard scaling: The scaling law that describes the proportional relationship between transistor size and power use; has not held since 2012 and is often confused with Moore’s Law.

Textbooks Are All You Need: Microsoft paper that introduces a new large language model for code, phi-1, that achieves smaller size by using higher quality “textbook” data.

GPQA and MMLU: Benchmarks for reasoning

Copilot: Microsoft product line of GPT consumer assistants from general productivity to design, vacation planning, cooking and fitness.

Devin: Autonomous AI code agent from Cognition Labs that Microsoft recently announced a partnership with.

Ray Solomonoff: Participant in the 1956 Dartmouth Summer Research Project on Artificial Intelligence that named the field; Kevin admires his prescience about the importance of probabilistic methods decades before anyone else.

00:00 - Introduction

01:20 - Kevin’s backstory

06:56 - The role of PhDs in AI engineering

09:56 - Microsoft’s AI strategy

12:40 - Highlights and lowlights

16:28 - Accelerating investments

18:38 - The OpenAI partnership

22:46 - Soon inference will dwarf training

27:56 - Will the demand/supply balance change?

30:51 - Business models for data

36:54 - The value function

39:58 - Copilots

44:47 - The 98/2 rule

49:34 - Solving zero-sum games

57:13 - Lightning round

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

XBOW CEO and GitHub Copilot Creator Oege de Moor: Cracking the Code on Offensive Security With AI

2024-12-1051:37

Ramp CEO Eric Glyman: Using AI to Build “Self-Driving Money”

2024-12-0338:48

Dust’s Gabriel Hubert and Stanislas Polu: Getting the Most From AI With Multiple Custom Agents

2024-11-2601:03:07

Clay’s Kareem Amin on Building the Sales ‘System of Action’ with AI

2024-11-1951:38

Decart’s Dean Leitersdorf on AI-Generated Video Games and Worlds

2024-11-1346:34

How Glean CEO Arvind Jain Solved the Enterprise Search Problem – and What It Means for AI at Work

2024-10-2944:48

OpenAI Researcher Dan Roberts on What Physics Can Teach Us About AI

2024-10-2241:42

Google NotebookLM’s Raiza Martin and Jason Spielman on Creating Delightful AI Podcast Hosts and the Potential for Source-Grounded AI

2024-10-1532:07

Snowflake CEO Sridhar Ramaswamy on Using Data to Create Simple, Reliable AI for Businesses

2024-10-0859:29

OpenAI's Noam Brown, Ilge Akkaya and Hunter Lightman on o1 and Teaching LLMs to Reason Better

2024-10-0245:22

Why Vlad Tenev and Tudor Achim of Harmonic Think AI Is About to Change Math—and Why It Matters

2024-09-2439:45

Jim Fan on Nvidia’s Embodied AI Lab and Jensen Huang’s Prediction that All Robots will be Autonomous

2024-09-1749:13

Founder Eric Steinberger on Magic’s Counterintuitive Approach to Pursuing AGI

2024-09-1051:15

Crucible Moments Returns for S2: The ServiceNow Story ft. CEO Frank Slootman & Founder Fred Luddy

2024-09-0342:53

Sierra Co-Founder Clay Bavor on Making Customer-Facing AI Agents Delightful

2024-08-2701:12:31

Phaidra’s Jim Gao on Building the Fourth Industrial Revolution with Reinforcement Learning

2024-08-2050:33

Fireworks Founder Lin Qiao on How Fast Inference and Small Models Will Benefit Businesses

2024-08-1339:18

GitHub CEO Thomas Dohmke on Building Copilot, and the the Future of Software Development

2024-08-0601:07:34

Meta’s Joe Spisak on Llama 3.1 405B and the Democratization of Frontier Models

2024-07-3042:07

Klarna CEO Sebastian Siemiatkowski on Getting AI to Do the Work of 700 Customer Service Reps

2024-07-2351:35

00:00

Microsoft CTO Kevin Scott on How Far Scaling Laws Will Extend

#box-pro-ellipsis-17352583190389{-webkit-line-clamp:2;}Microsoft CTO Kevin Scott on How Far Scaling Laws Will Extend

Microsoft CTO Kevin Scott on How Far Scaling Laws Will Extend

Sequoia Capital

Microsoft CTO Kevin Scott on How Far Scaling Laws Will Extend