Listen Top Shows Blog

Build LLMs From Scratch with Sebastian Raschka #52

Build LLMs From Scratch with Sebastian Raschka #52

Update: 2024-11-21

Share

Description

Our guest today is Sebastian Raschka, Senior Staff Research Engineer at Lightning AI and bestselling book author.

In our conversation, we first talk about Sebastian's role at Lightning AI and what the platform provides. We also dive into two great open source libraries that they've built to train, finetune, deploy and scale LLMs.: pytorch lightning and litgpt.

In the second part of our conversation, we dig into Sebastian's new book: "Build and LLM from Scratch". We discuss the key steps needed to train LLMs, the differences between GPT-2 and more recent models like Llama 3.1, multimodal LLMs and the future of the field.

If you enjoyed the episode, please leave a 5 star review and subscribe to the AI Stories Youtube channel.

Build a Large Language Model From Scratch Book: https://www.amazon.com/Build-Large-Language-Model-Scratch/dp/1633437167

Blog post on Multimodal LLMs: https://magazine.sebastianraschka.com/p/understanding-multimodal-llms

Lightning AI (with pytorch lightning and litgpt repos): https://github.com/Lightning-AI

Follow Sebastian on LinkedIn: https://www.linkedin.com/in/sebastianraschka/

Follow Neil on LinkedIn: https://www.linkedin.com/in/leiserneil/

---

(00:00 ) - Intro

(02:27 ) - How Sebastian got into Data & AI

(06:44 ) - Regression and Loss Function

(13:32 ) - Academia to Join LightningAI

(21:14 ) - Lightning AI VS other Cloud providers

(26:14 ) - Building PyTorch Lightning & LitGPT

(30:48 ) - Sebastian’s role as Staff Research Engineer

(34:35 ) - Build an LLM From Scratch

(45:00 ) - From GPT2 to Llama 3.1

(48:34 ) - Long Context VS RAG

(56:15 ) - Multimodal LLMs

(01:03:27 ) - Career Advice

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

Build LLMs From Scratch with Sebastian Raschka #52

Build LLMs From Scratch with Sebastian Raschka #52

2024-11-2101:06:03

Code Generation & Synthetic Data With Loubna Ben Allal #51

Code Generation & Synthetic Data With Loubna Ben Allal #51

2024-11-0747:06

He Built an AI Football Coach Assistant & Google Maps Algorithm with Petar Veličković #50

He Built an AI Football Coach Assistant & Google Maps Algorithm with Petar Veličković #50

2024-10-2201:06:54

Fine-Tuning LLMs, Hugging Face & Open Source with Lewis Tunstall #49

Fine-Tuning LLMs, Hugging Face & Open Source with Lewis Tunstall #49

2024-06-2001:20:40

MLOps Engineering & Coding Best Practices with Maria Vechtomova #48

MLOps Engineering & Coding Best Practices with Maria Vechtomova #48

2024-05-3059:51

OpenAI, AGI, LLMs Eval & Applied ML with Reah Miyara #47

OpenAI, AGI, LLMs Eval & Applied ML with Reah Miyara #47

2024-05-1601:04:21

Google, Gemini, Cloud & LLMOps with Erwin Huizenga #46

Google, Gemini, Cloud & LLMOps with Erwin Huizenga #46

2024-04-2501:03:32

Deep Learning for Autonomous Driving with Andras Palffy #45

Deep Learning for Autonomous Driving with Andras Palffy #45

2024-04-1058:12

Launching 7-Figures AI Products With Franziska Kirschner #44

Launching 7-Figures AI Products With Franziska Kirschner #44

2024-03-2601:05:28

How He Built The Best 7B Params LLM with Maxime Labonne #43

How He Built The Best 7B Params LLM with Maxime Labonne #43

2024-03-0753:46

From Biostatistician to DevRel at Deci AI with Harpreet Sahota #42

From Biostatistician to DevRel at Deci AI with Harpreet Sahota #42

2024-02-1959:24

Building AI Startups & Raising Funds with Ryan Shannon #41

Building AI Startups & Raising Funds with Ryan Shannon #41

2024-01-2901:11:22

Interpreting Black Box Models with Christoph Molnar #40

Interpreting Black Box Models with Christoph Molnar #40

2024-01-1055:18

From English Teacher to MLOps Leader with Demetrios Brinkmann #39

From English Teacher to MLOps Leader with Demetrios Brinkmann #39

2023-12-1944:39

MLOps & LLMOps with Noah Gift #38

MLOps & LLMOps with Noah Gift #38

2023-11-3001:11:21

Building Over 1000 Models for Uber with Marianne Ducournau #37

Building Over 1000 Models for Uber with Marianne Ducournau #37

2023-11-1601:07:29

World Number 1 on Kaggle with Christof Henkel #36

World Number 1 on Kaggle with Christof Henkel #36

2023-10-2601:08:12

The Story Behind Mosaic ML's $1.3 Billion Acquisition with Davis Blalock #35

The Story Behind Mosaic ML's $1.3 Billion Acquisition with Davis Blalock #35

2023-10-1001:05:45

Kellin Pelrine - How He Crushed A Superhuman Go-Playing AI 14 Games To 1 #34

Kellin Pelrine - How He Crushed A Superhuman Go-Playing AI 14 Games To 1 #34

2023-06-0801:09:56

Chanuki Seresinhe - Head of Data Science at Zoopla - Generative AI & AI for happiness #33

Chanuki Seresinhe - Head of Data Science at Zoopla - Generative AI & AI for happiness #33

2023-05-2557:13

00:00

00:00

x

Build LLMs From Scratch with Sebastian Raschka #52

Build LLMs From Scratch with Sebastian Raschka #52

Neil Leiser