The Uncertain Art of Accelerating ML Models with Sylvain Gugger

Update: 2024-10-14

Description

Sylvain Gugger is a former math teacher who fell into machine learning via a MOOC and became an expert in the low-level performance details of neural networks. He’s now on the ML infrastructure team at Jane Street, where he helps traders speed up their models. In this episode, Sylvain and Ron go deep on learning rate schedules; the subtle performance bugs PyTorch lets you write; how to keep a hungry GPU well-fed; and lots more, including the foremost importance of reproducibility in training runs. They also discuss some of the unique challenges of doing ML in the world of trading, like the unusual size and shape of market data and the need to do inference at shockingly low latencies.

You can find the transcript for this episode on our website.

Some links to topics that came up in the discussion:

“Practical Deep Learning for Coders,” a FastAI MOOC by Jeremy Howard, and the book, of which Sylvain is a co-author.
The Stanford DAWNBench competition that Sylvain participated in.
HuggingFace, and the Accelerate library that Sylvain wrote there.
Some of the languages/systems for expression ML models that were discussed: PyTorch, TensorFlow, Jax, Mojo, and Triton
CUDA graphs and streams
Hogwild concurrency

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

The Uncertain Art of Accelerating ML Models with Sylvain Gugger

2024-10-1401:06:22

Solving Puzzles in Production with Liora Friedberg

2024-10-0753:50

From the Lab to the Trading Floor with Erin Murphy

2024-07-1201:03:35

Performance Engineering on Hard Mode with Andrew Hunter

2023-11-2855:34

A Poet's Guide to Product Management with Peter Bogart-Johnson

2023-08-1501:02:17

The Future of Programming with Richard Eisenberg

2023-05-1859:37

Swapping the Engine Out of a Moving Race Car with Ella Ehrlich

2022-09-1201:00:28

State Machine Replication, and Why You Should Care with Doug Patti

2022-04-2001:12:09

Memory Management with Stephen Dolan

2022-01-0501:22:34

What Is an Operating System? with Anil Madhavapeddy

2021-11-0301:01:11

Building a UI Framework with Ty Overby

2021-10-0601:00:04

Writing, Technically with James Somers

2021-09-0101:00:58

More Signals & Threads coming soon!

2021-08-2400:37

An inside look at Jane Street's tech internship with Jeanne Van Briesen, Matt Else, and Grace Zhang

2020-11-0601:02:52

Building a functional email server with Dominick LoBraico

2020-10-2801:03:36

Language design with Leo White

2020-10-2101:07:59

Clock synchronization with Chris Perl

2020-10-1444:28

Python, OCaml, and Machine Learning with Laurent Mazare

2020-10-0759:33

Compiler optimization with Greta Yorsh

2020-09-3001:10:17

Multicast and the markets with Brian Nigito

2020-09-2301:02:09

00:00

The Uncertain Art of Accelerating ML Models with Sylvain Gugger

#box-pro-ellipsis-173486194568095{-webkit-line-clamp:2;}The Uncertain Art of Accelerating ML Models with Sylvain Gugger

The Uncertain Art of Accelerating ML Models with Sylvain Gugger

Jane Street

The Uncertain Art of Accelerating ML Models with Sylvain Gugger