Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

Update: 2024-05-18

Description

Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan:

Will one model rule them all?
What does "high quality data" actually mean in the context of LLM training?
Is language model pre-training becoming commoditized?
Are companies like Google and OpenAI keeping their AI secrets to themselves?
Does the startup or open source community stand a chance next to the giants?

Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.

Comments

In Channel

AI Therapy with Slingshot's Derrick Hull

2025-03-1736:15

What if we could cure loneliness? Philosophy, dopamine, and more with Mark Ungless

2025-02-2601:11:45

Does Philosophy Make Progress? Chatting with Every's Dan Shipper

2025-01-2350:38

OpenAI o1: Another GPT-3 moment?

2024-10-1851:52

The Future is Fine Tuned (with Dev Rishi, Predibase)

2024-05-2452:28

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

2024-05-1837:36

On Adversarial Training & Robustness with Bhavna Gopal

2024-05-0844:05

On Emotionally Intelligent AI (with Chris Gagne, Hume AI)

2024-04-1939:53

Why Greatness Cannot Be Planned (with Joel Lehman)

2024-03-2247:07

Where are the good AI products? (with Varun Shenoy)

2024-03-1246:54

The End of RAG (with Donato Riccio)

2024-02-0940:13

GPUs and how the cloud is changing (with Cedana Founder, Neel Master)

2024-02-0242:48

Live Video Translation with AI

2024-01-2228:37

Is open-source AI safe? (with SafeLlama founder, Enoch Kan)

2024-01-1236:55

What is the future of AI-assisted or AI-driven software?

2024-01-0532:54

Is the Turing Test Outdated?

2023-12-1545:17

Prompt Engineering

2023-12-0838:48

AI for Investment Diligence

2023-11-2433:34

The Myth of Human-in-the-Loop

2023-11-1536:28

AI-Generated Content and the Boring Apocalypse

2023-11-0839:34

00:00

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

#box-pro-ellipsis-175765658065946{-webkit-line-clamp:2;}Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

Daniel Reid Cahn

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind