Listen Top Shows Blog

Genie: Generative Interactive Environments with Ashley Edwards - #696

Genie: Generative Interactive Environments with Ashley Edwards - #696

Update: 2024-08-05

Share

Description

Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for creating ‘playable’ video environments for training deep reinforcement learning (RL) agents at scale in a completely unsupervised manner. We explore the motivations behind Genie, the challenges of data acquisition for RL, and Genie’s capability to learn world models from videos without explicit action data, enabling seamless interaction and frame prediction. Ashley walks us through Genie’s core components—the latent action model, video tokenizer, and dynamics model—and explains how these elements collaborate to predict future frames in video sequences. We discuss the model architecture, training strategies, benchmarks used, as well as the application of spatiotemporal transformers and the MaskGIT techniques used for efficient token prediction and representation. Finally, we touched on Genie’s practical implications, its comparison to other video generation models like “Sora,” and potential future directions in video generation and diffusion models.

The complete show notes for this episode can be found at https://twimlai.com/go/696.

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

AI Agents for Data Analysis with Shreya Shankar - #703

AI Agents for Data Analysis with Shreya Shankar - #703

2024-10-0147:54

Stealing Part of a Production Language Model with Nicholas Carlini - #702

Stealing Part of a Production Language Model with Nicholas Carlini - #702

2024-09-2301:03:00

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701

2024-09-1601:13:45

Automated Design of Agentic Systems with Shengran Hu - #700

Automated Design of Agentic Systems with Shengran Hu - #700

2024-09-0359:42

The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699

The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699

2024-08-2745:34

The Building Blocks of Agentic Systems with Harrison Chase - #698

The Building Blocks of Agentic Systems with Harrison Chase - #698

2024-08-1959:30

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

2024-08-1246:37

Genie: Generative Interactive Environments with Ashley Edwards - #696

Genie: Generative Interactive Environments with Ashley Edwards - #696

2024-08-0547:05

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

2024-07-3057:35

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

2024-07-2301:20:19

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

2024-07-1757:46

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692

2024-07-0943:28

How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691

How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691

2024-07-0157:12

Long Context Language Models and their Biological Applications with Eric Nguyen - #690

Long Context Language Models and their Biological Applications with Eric Nguyen - #690

2024-06-2545:11

Accelerating Sustainability with AI with Andres Ravinet - #689

Accelerating Sustainability with AI with Andres Ravinet - #689

2024-06-1848:01

Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - #688

Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - #688

2024-06-1001:10:42

Energy Star Ratings for AI Models with Sasha Luccioni - #687

Energy Star Ratings for AI Models with Sasha Luccioni - #687

2024-06-0348:39

Language Understanding and LLMs with Christopher Manning - #686

Language Understanding and LLMs with Christopher Manning - #686

2024-05-2756:22

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - #685

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - #685

2024-05-2042:35

Powering AI with the World's Largest Computer Chip with Joel Hestness - #684

Powering AI with the World's Largest Computer Chip with Joel Hestness - #684

2024-05-1354:36

00:00

00:00

x

Genie: Generative Interactive Environments with Ashley Edwards - #696

Genie: Generative Interactive Environments with Ashley Edwards - #696

Sam Charrington