“Announcing Gemma Scope 2” by CallumMcDougall

Update: 2025-12-22

Description

TLDR

DeepMind LMI is releasing Gemma Scope 2: a suite of SAEs & transcoders trained on the Gemma 3 model family
- Neuronpedia demo here, access the weights on HuggingFace here, try out the Colab notebook tutorial here [1]
Key features of this relative to the previous Gemma Scope release:
- More advanced model family (V3 rather than V2) should enable analysis of more complex forms of behaviour
- More comprehensive release (SAEs on every layer, for all models up to size 27b, plus multi-layer models like crosscoders and CLTs)
- More focus on chat models (every SAE trained on a PT model has a corresponding version finetuned for IT models)
Although we've deprioritized fundamental research on tools like SAEs (see reasoning here), we still hope these will serve as a useful tool for the community

Some example latents

Here are some example latents taken from the residual stream SAEs for Gemma V3 27B IT.

Layer 53, feature 50705Layer 31, Feature 23266Layer 53, feature 57326Layer 53, feature 2878Layer 53, feature 57326

What the release contains

This release contains SAEs trained on 3 different sites (residual stream, MLP output and attention output) as well as MLP transcoders (both with and without affine skip [...]

---

Outline:

(00:10 ) TLDR

(01:09 ) Some example latents

(01:51 ) What the release contains

(03:34 ) Which ones should you use?

(04:56 ) Some useful links

The original text contained 1 footnote which was omitted from this narration.

---

First published:

December 22nd, 2025

Source:

https://www.lesswrong.com/posts/YQro5LyYjDzZrBCdb/announcing-gemma-scope-2

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

“The Benefits of Meditation Come From Telling People That You Meditate” by ThirdEyeJoe (cousin of CottonEyedJoe)

2025-12-2403:30

“The ML drug discovery startup trying really, really hard to not cheat” by Abhishaike Mahajan

2025-12-2440:04

“The future of alignment if LLMs are a bubble” by Stuart_Armstrong

2025-12-2309:50

“Keeping Up Against the Joneses: Balsa’s 2025 Fundraiser” by Zvi

2025-12-2312:08

“Grounding Value Learning in Evolutionary Psychology: an Alternative Proposal to CEV” by RogerDearnaley

2025-12-2336:47

“Announcing Gemma Scope 2” by CallumMcDougall

2025-12-2205:37

“The Revolution of Rising Expectations” by Zvi

2025-12-2237:28

“Entrepreneurship is mostly zero-sum” by lc

2025-12-2204:28

“Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance” by ryan_greenblatt

2025-12-2236:53

“Irresponsible and Unreasonable Takes on Meetups Organizing” by Screwtape

2025-12-2209:57

“Small Models Can Introspect, Too” by vgel

2025-12-2207:52

[Linkpost] “What’s the Current Stock Market Bubble?” by PeterMcCluskey

2025-12-2204:44

[Linkpost] “No God Can Help You” by Ape in the coat

2025-12-2204:58

“Can Claude teach me to make coffee?” by philh

2025-12-2130:34

“Turning 20 in the probable pre-apocalypse” by Parv Mahajan

2025-12-2105:04

“Technoromanticism” by lsusr

2025-12-2110:19

“Digital intentionality: What’s the point?” by mingyuan

2025-12-2106:23

“The unreasonable deepness of number theory” by wingspan

2025-12-2121:43

“Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment” by Cam, Puria Radmard, Kyle O’Brien, David Africa, Samuel Ratnam, andyk

2025-12-2120:58

“Contradict my take on OpenPhil’s past AI beliefs” by Eliezer Yudkowsky

2025-12-2005:51

00:00

“Announcing Gemma Scope 2” by CallumMcDougall

#box-pro-ellipsis-176658424350974{-webkit-line-clamp:2;}“Announcing Gemma Scope 2” by CallumMcDougall

“Announcing Gemma Scope 2” by CallumMcDougall

“Announcing Gemma Scope 2” by CallumMcDougall