Representation-Based Exploration for Language Models: From Test-Time to Post-Training

Update: 2025-10-18

Description

This paper investigates the effectiveness of deliberate exploration in enhancing the reasoning capabilities of large language models (LLMs) trained with reinforcement learning (RL). The authors propose and evaluate a novel representation-based exploration (RepExp) strategy, which uses a bonus derived from the LLM's hidden states to encourage the discovery of diverse and novel behaviors. The study employs a two-pronged evaluation methodology, first testing RepExp in an inference-time setting for selecting diverse responses and then integrating it into the RL post-training pipeline. Key findings indicate that this exploration method significantly improves verifier efficiency and mitigates the "diversity collapse" phenomenon observed in standard RL methods, suggesting that the approach moves beyond merely sharpening existing model capabilities. The results show RepExp provides substantial improvements in pass@k rates and is especially beneficial for stronger models and harder reasoning problems across various tasks like MATH and GSM8K.

Comments

In Channel

Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL

2025-10-2214:33

Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior

2025-10-2219:04

A Definition of AGI

2025-10-2216:28

Provably Learning from Language Feedback

2025-10-2119:55

In-Context Learning for Pure Exploration

2025-10-2116:30

On the Role of Preference Variance in Preference Optimization

2025-10-2014:42

Training LLM Agents to Empower Humans

2025-10-2013:38

Richard Sutton Declares LLMs a Dead End

2025-10-2013:20

Demystifying Reinforcement Learning in Agentic Reasoning

2025-10-1915:21

Emergent coordination in multi-agent language models

2025-10-1913:57

Learning-to-measure: in-context active feature acquisition

2025-10-1916:02

Andrej Karpathy's insights: AGI, Intelligence, and Evolution

2025-10-1916:11

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

2025-10-1812:48

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

2025-10-1817:02

The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injections

2025-10-1816:08

When can in-context learning generalize out of task distribution?

2025-10-1619:44

The Art of Scaling Reinforcement Learning Compute for LLMs

2025-10-1613:41

A small number of samples can poison LLMs of any size

2025-10-1613:58

Dual Goal Representations

2025-10-1417:11

Welcome to the Era of Experience

2025-10-1416:42

00:00

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

#box-pro-ellipsis-176114291368243{-webkit-line-clamp:2;}Representation-Based Exploration for Language Models: From Test-Time to Post-Training

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

Enoch H. Kang

Representation-Based Exploration for Language Models: From Test-Time to Post-Training