After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Update: 2025-11-25

Description

Fei-Fei Li and Justin Johnson are cofounders of World Labs, who have recently launched Marble (https://marble.worldlabs.ai/), a new kind of generative “world model” that can create editable 3D environments from text, images, and other spatial inputs. Marble lets creators generate persistent 3D worlds, precisely control cameras, and interactively edit scenes, making it a powerful tool for games, film, VR, robotics simulation, and more. In this episode, Fei-Fei and Justin share how their journey from ImageNet and Stanford research led to World Labs, why spatial intelligence is the next frontier after LLMs, and how world models could change how machines see, understand, and build in 3D.

We discuss:

The massive compute scaling from AlexNet to today and why world models and spatial data are the most compelling way to “soak up” modern GPU clusters compared to language alone.
What Marble actually is: a generative model of 3D worlds that turns text and images into editable scenes using Gaussian splats, supports precise camera control and recording, and runs interactively on phones, laptops, and VR headsets.
Fei-fei’s essay (https://drfeifei.substack.com/p/from-words-to-worlds-spatial-intelligence) on spatial intelligence as a distinct form of intelligence from language: from picking up a mug to inferring the 3D structure of DNA, and why language is a lossy, low-bandwidth channel for describing the rich 3D/4D world we live in.
Whether current models “understand” physics or just fit patterns: the gap between predicting orbits and discovering F=ma, and how attaching physical properties to splats and distilling physics engines into neural networks could lead to genuine causal reasoning.
The changing role of academia in AI, why Fei-Fei worries more about under-resourced universities than “open vs closed,” and how initiatives like national AI compute clouds and open benchmarks can rebalance the ecosystem.
Why transformers are fundamentally set models, not sequence models, and how that perspective opens up new architectures for world models, especially as hardware shifts from single GPUs to massive distributed clusters.
Real use cases for Marble today: previsualization and VFX, game environments, virtual production, interior and architectural design (including kitchen remodels), and generating synthetic simulation worlds for training embodied agents and robots.
How spatial intelligence and language intelligence will work together in multimodal systems, and why the goal isn’t to throw away LLMs but to complement them with rich, embodied models of the world.
Fei-Fei and Justin’s long-term vision for spatial intelligence: from creative tools for artists and game devs to broader applications in science, medicine, and real-world decision-making.

—

Fei-Fei Li

X: https://x.com/drfeifei
LinkedIn: https://www.linkedin.com/in/fei-fei-li-4541247

Justin Johnson

X: https://x.com/jcjohnss
LinkedIn: https://www.linkedin.com/in/justin-johnson-41b43664

Where to find Latent Space

X: https://x.com/latentspacepod
Substack: https://www.latent.space/

Chapters

00:00:00 Introduction and the Fei-Fei Li & Justin Johnson Partnership
00:02:00 From ImageNet to World Models: The Evolution of Computer Vision
00:12:42 Dense Captioning and Early Vision-Language Work
00:19:57 Spatial Intelligence: Beyond Language Models
00:28:46 Introducing Marble: World Labs' First Spatial Intelligence Model
00:33:21 Gaussian Splats and the Technical Architecture of Marble
00:22:10 Physics, Dynamics, and the Future of World Models
00:41:09 Multimodality and the Interplay of Language and Space
00:37:37 Use Cases: From Creative Industries to Robotics and Embodied AI
00:56:58 Hiring, Research Directions, and the Future of World Labs

Comments

In Channel

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

2025-11-2501:00:38

⚡️ 10x AI Engineers with $1m Salaries — Alex Lieberman & Arman Hezarkhani, Tenex

2025-11-1927:10

Anthropic, Glean & OpenRouter: How AI Moats Are Built with Deedy Das of Menlo Ventures

2025-11-1401:25:27

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

2025-11-1035:51

⚡ [AIE CODE Preview] Inside Google Labs: Building The Gemini Coding Agent — Jed Borovik, Jules

2025-11-1043:52

Priscilla Chan and Mark Zuckerberg: Frontier AI + Virtual Biology To Solve All Diseases

2025-11-0653:33

⚡️ Ship AI recap: Agents, Workflows, and Python — w/ Vercel CTO Malte Ubl

2025-10-3142:01

The Agents Economy Backbone - with Emily Glassberg Sands, Head of Data & AI at Stripe

2025-10-3001:37:12

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

2025-10-1601:08:22

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

2025-10-0745:07

Taste is your Moat (Dylan Field of Figma)

2025-10-0201:01:42

Amp: The Emperor Has No Clothes

2025-09-2501:20:12

Context Engineering for Agents - Lance Martin, LangChain

2025-09-1157:32

A Technical History of Generative Media

2025-09-0501:01:09

Better Data is All You Need — Ari Morcos, Datology

2025-08-2901:18:42

Long Live Context Engineering - with Jeff Huber of Chroma

2025-08-1957:00

Greg Brockman on OpenAI's Road to AGI

2025-08-1501:08:36

The RLVR Revolution — with Nathan Lambert (AI2, Interconnects.ai)

2025-07-3101:18:59

AI is Eating Search

2025-07-2356:21

Cline: the open source coding agent that doesn't cut costs

2025-07-16--:--

00:00

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

#box-pro-ellipsis-176424226774314{-webkit-line-clamp:2;}After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Chapters

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs