Listen Top Shows Blog

Interviewing Andrew Trask on how language models should store (and access) information

Interviewing Andrew Trask on how language models should store (and access) information

Update: 2024-10-10

Share

Description

Andrew Trask is one of the bright spots in engaging with AI policy for me in the last year. He is a passionate idealist, trying to create a future for AI that enables privacy, academic research, and government involvement in a rapidly transforming ecosystem. Trask is a leader of the OpenMined organization facilitating researcher access to non-public data and AIs, a senior research scientist at Google DeepMind, a PhD student at the University of Oxford, an author and educator on Deep Learning.

You can find more about Trask on Twitter or Google Scholar. You may want to watch his recent talk at Cohere on the future of AI (and why data breakthroughs dominate), his lecture at MIT on privacy preserving ML, or his book on deep learning that has a substantial GitHub component. Here’s a slide I liked from his recent Cohere talk:

The organization he helps run, OpenMined, has a few principles that say a lot about his ambitions and approaches to modern AI:

We believe we can inspire all data owners to open their data for research by building open-source privacy software that empowers them to receive more benefits (co-authorships, citations, grants, etc.) while mitigating risks related to privacy, security, and IP.

We cover privacy of LLMs, retrieval LLMs, secure enclaves, o1, Apple's new models, and many more topics.

More on Andrew: https://x.com/iamtrask

Transcript and more information: https://www.interconnects.ai/p/interviewing-andrew-trask

Interconnects (https://www.interconnects.ai/)...

... on YouTube: https://www.youtube.com/@interconnects

... on Twitter: https://x.com/interconnectsai

... on Linkedin: https://www.linkedin.com/company/interconnects-ai

... on Spotify: https://open.spotify.com/show/2UE6s7wZC4kiXYOnWRuxGv

We Mention

* Claude 3.5 launch and “pre release testing with UK AISI” (and the US AI Safety Institute)

* OpenMined and PySyft

* CSET (Center for Security and Emerging Technology)

* The “open data wall”

* Apple’s Secure Enclaves, Nvidia Secure Enclave

* Data-store language models literature

* RETRO: Retrieval-Enhanced Transformer from DeepMind (2021)

* SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore (2023)

* Scaling Retrieval-Based Language Models with a Trillion-Token Datastore (2024)

Chapters

[00:00:00 ] Introduction

[00:03:12 ] Secure enclaves and pre-release testing with Anthropic and UK Safety Institute

[00:16:31 ] Discussion on public AI and government involvement

[00:20:55 ] Data store language models and better approaches to “open training data”

[00:42:18 ] History and development of OpenMined

[00:48:57 ] Use of language models on air-gapped networks

[00:52:10 ] Near future of secure enclave technology and industry adoption

[00:58:01 ] Conclusions and future trajectory of AI development

Get full access to Interconnects at www.interconnects.ai/subscribe

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

(Voiceover) OpenAI's o3: The grand finale of AI in 2024

(Voiceover) OpenAI's o3: The grand finale of AI in 2024

2024-12-2017:58

(Voiceover) The AI agent spectrum

(Voiceover) The AI agent spectrum

2024-12-1811:00

(Voiceover) OpenAI's Reinforcement Finetuning and RL for the masses

(Voiceover) OpenAI's Reinforcement Finetuning and RL for the masses

2024-12-1112:40

Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning

Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning

2024-12-0501:08:33

(Voiceover) OpenAI's o1 using "search" was a PSYOP

(Voiceover) OpenAI's o1 using "search" was a PSYOP

2024-12-0412:13

(Voiceover) OLMo 2 and building effective teams for training language models

(Voiceover) OLMo 2 and building effective teams for training language models

2024-11-2610:26

(Voiceover) Tülu 3: The next era in open post-training

(Voiceover) Tülu 3: The next era in open post-training

2024-11-2107:59

(Voiceover) Scaling realities

(Voiceover) Scaling realities

2024-11-1404:21

(Voiceover) Saving the National AI Research Resource & my AI policy outlook

(Voiceover) Saving the National AI Research Resource & my AI policy outlook

2024-11-1311:22

Interviewing Tim Dettmers on open-source AI: Agents, scaling, quantization and what's next

Interviewing Tim Dettmers on open-source AI: Agents, scaling, quantization and what's next

2024-11-0701:15:45

Interviewing Andrew Carr of Cartwheel on the State of Generative AI

Interviewing Andrew Carr of Cartwheel on the State of Generative AI

2024-10-3154:10

(Voiceover) Why I build open language models

(Voiceover) Why I build open language models

2024-10-3010:19

(Voiceover) Claude's agentic future and the current state of the frontier models

(Voiceover) Claude's agentic future and the current state of the frontier models

2024-10-2311:23

Interviewing Arvind Narayanan on making sense of AI hype

Interviewing Arvind Narayanan on making sense of AI hype

2024-10-1754:21

(Voiceover) Building on evaluation quicksand

(Voiceover) Building on evaluation quicksand

2024-10-1616:36

Interviewing Andrew Trask on how language models should store (and access) information

Interviewing Andrew Trask on how language models should store (and access) information

2024-10-1001:00:12

How scaling changes model behavior

How scaling changes model behavior

2024-10-0911:47

[Article Voiceover] AI Safety's Crux: Culture vs. Capitalism

[Article Voiceover] AI Safety's Crux: Culture vs. Capitalism

2024-10-0210:29

Interviewing Riley Goodside on the science of prompting

Interviewing Riley Goodside on the science of prompting

2024-09-3001:08:39

[Article Voiceover] Llama 3.2 Vision and Molmo: Foundations for the multimodal open-source ecosystem

[Article Voiceover] Llama 3.2 Vision and Molmo: Foundations for the multimodal open-source ecosystem

2024-09-2714:04

00:00

00:00

x

Interviewing Andrew Trask on how language models should store (and access) information

Interviewing Andrew Trask on how language models should store (and access) information

Nathan Lambert