AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope

Update: 2023-07-23

Description

Link to original article

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: QAPR 5: grokking is maybe not that big a deal?, published by Quintin Pope on July 23, 2023 on The AI Alignment Forum.
[Thanks to support from Cavendish Labs and a Lightspeed grant, .I've been able to restart the Quintin's Alignment Papers Roundup sequence.]
Introduction
Grokking refers to an observation by Power et al. (below) that models trained on simple modular arithmetic tasks would first overfit to their training data and achieve nearly perfect training loss, but that training well past the point of overfitting would eventually cause the models to generalize to unseen test data. The rest of this post discusses a number of recent papers on grokking.
Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets
In this paper we propose to study generalization of neural networks on small algorithmically generated datasets. In this setting, questions about data efficiency, memorization, generalization, and speed of learning can be studied in great detail. In some situations we show that neural networks learn through a process of "grokking" a pattern in the data, improving generalization performance from random chance level to perfect generalization, and that this improvement in generalization can happen well past the point of overfitting. We also study generalization as a function of dataset size and find that smaller datasets require increasing amounts of optimization for generalization. We argue that these datasets provide a fertile ground for studying a poorly understood aspect of deep learning: generalization of overparametrized neural networks beyond memorization of the finite training dataset.
My opinion:
When I first read this paper, I was very excited. It seemed like a pared-down / "minimal" example that could let us study the underlying mechanism behind neural network generalization. You can read more of my initial opinion on grokking in the post Hypothesis: gradient descent prefers general circuits.
I now think I was way too excited about this paper, that grokking is probably a not-particularly-important optimization artifact, and that grokking is no more connected to the "core" of deep learning generalization than, say, the fact that it's possible for deep learning to generalize from an MNIST training set to the testing set.
I also think that using the word "grokking" was anthropomorphizing and potentially misleading (like calling the adaptive information routing component of a transformer model its "attention"). Evocative names risk letting the connotations of the name filter into the analysis of the object being named. E.g.,
"Grokking" brings connotations of sudden realization, despite the fact that the grokking phase in the above plot starts within the first ~5% - 20% of the training process, though it appears much more abrupt due to the use of a base 10 logarithmic scale on the x-axis.
"Grokking" also brings connotations of insight, realization or improvement relative to some previously confused baseline. This leads to the impression that things which grok are better than things which don't.
Humans often use the word "grokking" to mean deeply understanding complex domains that actually matter in the real world. Using the same word in an ML context suggests that ML grokking is relevant to whatever mechanisms might let an ML system deeply understand complex domains that actually matter in the real world.
I've heard several people say things like:
Studying grokking could significantly advance ML capabilities, if doing so were to lead to a deeper understanding of the mechanisms underlying generalization in ML.
Training long enough could eventually result in grokking occurring in ML domains of actual relevance, such as language, and thereby lead to sudden capabilities gains or break alignment properties.
Grokking is an example of how thinking l...

Comments

In Channel

AF - Meta Questions about Metaphilosophy by Wei Dai

2023-09-0104:42

AF - Red-teaming language models via activation engineering by Nina Rimsky

2023-08-2612:38

AF - Causality and a Cost Semantics for Neural Networks by scottviteri

2023-08-2116:47

AF - "Dirty concepts" in AI alignment discourses, and some guesses for how to deal with them by Nora Ammann

2023-08-2005:36

AF - A Proof of Löb's Theorem using Computability Theory by Jessica Taylor

2023-08-1605:28

AF - Reducing sycophancy and improving honesty via activation steering by NinaR

2023-07-2814:26

AF - How LLMs are and are not myopic by janus

2023-07-2513:24

AF - Open problems in activation engineering by Alex Turner

2023-07-2401:37

AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope

2023-07-2316:29

AF - Priorities for the UK Foundation Models Taskforce by Andrea Miotti

2023-07-2109:51

AF - Alignment Grantmaking is Funding-Limited Right Now by johnswentworth

2023-07-1902:26

AF - Measuring and Improving the Faithfulness of Model-Generated Reasoning by Ansh Radhakrishnan

2023-07-1810:16

AF - Using (Uninterpretable) LLMs to Generate Interpretable AI Code by Joar Skalse

2023-07-0205:04

AF - Agency from a causal perspective by Tom Everitt

2023-06-3011:40

AF - Catastrophic Risks from AI #4: Organizational Risks by Dan H

2023-06-2639:24

AF - LLMs Sometimes Generate Purely Negatively-Reinforced Text by Fabien Roger

2023-06-1612:34

AF - Contrast Pairs Drive the Empirical Performance of Contrast Consistent Search (CCS) by Scott Emmons

2023-05-3112:27

AF - PaLM-2 and GPT-4 in "Extrapolating GPT-N performance" by Lukas Finnveden

2023-05-3011:50

AF - Wikipedia as an introduction to the alignment problem by SoerenMind

2023-05-2901:37

AF - [Linkpost] Interpretability Dreams by DanielFilan

2023-05-2403:26

00:00

AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope

#box-pro-ellipsis-176263715831876{-webkit-line-clamp:2;}AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope

AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope

Quintin Pope

AF - QAPR 5: grokking is maybe not that big a deal? by Quintin Pope