Information Theory for Language Models: Jack Morris

Update: 2025-07-02

Description

Our last AI PhD grad student feature was Shunyu Yao, who happened to focus on Language Agents for his thesis and immediately went to work on them for OpenAI. Our pick this year is Jack Morris, who bucks the “hot” trends by -not- working on agents, benchmarks, or VS Code forks, but is rather known for his work on the information theoretic understanding of LLMs, starting from embedding models and latent space representations (always close to our heart).

Jack is an unusual combination of doing underrated research but somehow still being to explain them well to a mass audience, so we felt this was a good opportunity to do a different kind of episode going through the greatest hits of a high profile AI PhD, and relate them to questions from AI Engineering.

Papers and References made

AI grad school: https://x.com/jxmnop/status/1933884519557353716
A new type of information theory: https://x.com/jxmnop/status/1904238408899101014
Embeddings
- Text Embeddings Reveal (Almost) As Much As Text: https://arxiv.org/abs/2310.06816
- Contextual document embeddings https://arxiv.org/abs/2410.02525
  Harnessing the Universal Geometry of Embeddings: https://arxiv.org/abs/2505.12540
Language models
- GPT-style language models memorize 3.6 bits per param: https://x.com/jxmnop/status/1929903028372459909
- Approximating Language Model Training Data from Weights: https://arxiv.org/abs/2506.15553
  - https://x.com/jxmnop/status/1936044666371146076
LLM Inversion
"There Are No New Ideas In AI.... Only New Datasets"
- https://x.com/jxmnop/status/1910087098570338756
- https://blog.jxmo.io/p/there-are-no-new-ideas-in-ai-only
misc reference: https://junyanz.github.io/CycleGAN/

—

for others hiring AI PhDs, Jack also wanted to shout out his coauthor

Zach Nussbaum, his coauthor on Nomic Embed: Training a Reproducible Long Context Text Embedder.

Comments

In Channel

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

2025-10-1601:08:22

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

2025-10-0745:07

Taste is your Moat (Dylan Field of Figma)

2025-10-0201:01:42

Amp: The Emperor Has No Clothes

2025-09-2501:20:12

Context Engineering for Agents - Lance Martin, LangChain

2025-09-1157:32

A Technical History of Generative Media

2025-09-0501:01:09

Better Data is All You Need — Ari Morcos, Datology

2025-08-2901:18:42

Long Live Context Engineering - with Jeff Huber of Chroma

2025-08-1957:00

Greg Brockman on OpenAI's Road to AGI

2025-08-1501:08:36

The RLVR Revolution — with Nathan Lambert (AI2, Interconnects.ai)

2025-07-3101:18:59

AI is Eating Search

2025-07-2356:21

Cline: the open source coding agent that doesn't cut costs

2025-07-16--:--

Personalized AI Language Education — with Andrew Hsu, Speak

2025-07-1101:04:09

AI Video Is Eating The World — Olivia and Justine Moore, a16z

2025-07-09--:--

Information Theory for Language Models: Jack Morris

2025-07-02--:--

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

2025-06-1901:17:46

The Shape of Compute (Chris Lattner of Modular)

2025-06-1301:18:17

The Utility of Interpretability — Emmanuel Amiesen

2025-06-0601:53:01

[AIEWF Preview] Containing Agent Chaos — Solomon Hykes

2025-06-0327:13

[AIEWF Preview] Gemini in 2025 and Realtime Voice AI

2025-06-0224:28

00:00

Information Theory for Language Models: Jack Morris

#box-pro-ellipsis-176113518324170{-webkit-line-clamp:2;}Information Theory for Language Models: Jack Morris

Information Theory for Language Models: Jack Morris

Information Theory for Language Models: Jack Morris