Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education

Update: 2024-08-01

Description

This week, I had the pleasure of chatting with Sebastian Raschka. Sebastian is doing a ton of work on the open language model ecosystem and AI research broadly. He’s been writing the great Ahead of AI newsletter (that has the biggest audience overlap with Interconnects, at 26%, so a lot of you know him) and multiple educational books, all on top of being a full time machine learning engineer at Lightning.ai, where he maintains LitGPT, which he described as being like Karpathy’s NanoGPT, with slightly more abstractions.

This conversation mostly surrounds keeping up with AI research, the state of the open LLM ecosystem post Llama 3.1, and many narrow topics in between. I learned that Sebastian used to be an Arxiv moderator, which gives some simple color on how Arxiv and sifting through thousands of papers works. We cover a lot of ground here, so I hope you enjoy it.

00:00:00 Introduction & Sebastian's background
00:04:28 The state of deep learning and language models in 2018
00:08:02 Sebastian's work at Lightning AI and LitGPT
00:12:23 Distillation and its potential in language model training
00:14:14 Implementing language models and common pitfalls
00:18:45 Modern architectures: Mixture of experts models, early v. late fusion multimodal
00:24:23 Sebastian's book on building language models from scratch
00:27:13 Comparing ChatGPT, Claude, and Google's Gemini for various tasks
00:38:21 Vibing and checking new language models during implementation
00:40:42 Selecting papers to read and moderating Arxiv
00:45:36 Motivation for working on AI education
00:52:46 Llama 3 fine-tuning
00:57:26 The potential impact of AI on jobs in writing and education
01:00:57 The future directions of AI

More details: https://www.interconnects.ai/interviewing-sebastian-raschka

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

AI Safety's Crux: Culture vs. Capitalism

2024-10-0210:30

Interviewing Riley Goodside on the science of prompting

2024-09-3001:08:39

Llama 3.2 Vision and Molmo: Foundations for the multimodal open-source ecosystem

2024-09-2714:04

Reverse engineering OpenAI's o1

2024-09-1718:52

Futures of the data foundry business model

2024-09-1111:32

A post-training approach to AI regulation with Model Specs

2024-09-1005:39

OpenAI's Strawberry, LM self-talk, inference scaling laws, and spending more on inference

2024-09-0510:40

OLMoE and the hidden simplicity in training better foundation models

2024-09-0410:31

On the current definitions of open-source AI and the state of the data commons

2024-08-2808:01

Nous Hermes 3 and exploiting underspecified evaluations

2024-08-1608:32

Interviewing Ross Taylor on LLM reasoning, Llama fine-tuning, Galactica, agents

2024-08-0801:02:22

A recipe for frontier model post-training

2024-08-0710:24

Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education

2024-08-0101:03:42

GPT-4o-mini changed ChatBotArena

2024-07-3107:55

Llama 3.1 405b, Meta's AI strategy, and the new open frontier model ecosystem

2024-07-2315:22

SB 1047, AI regulation, and unlikely allies for open models

2024-07-1714:20

Switched to Claude 3.5

2024-07-0306:40

Interviewing Dean Ball on AI policy

2024-06-2756:31

RLHF Roundup: Trying to get good at PPO, charting RLHF's impact, RewardBench retrospective, and a reward model competition

2024-06-2611:52

Frontiers in synthetic data

2024-06-2111:27

00:00

Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education

#box-pro-ellipsis-173478303508824{-webkit-line-clamp:2;}Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education

Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education

Nathan Lambert

Interviewing Sebastian Raschka on the state of open LLMs, Llama 3.1, and AI education