Provably Learning from Language Feedback

Update: 2025-10-21

Description

This paper introduces a new formal framework called Learning from Language Feedback (LLF), which addresses the challenge of training AI agents, particularly large language models (LLMs), using rich natural language critiques and guidance instead of traditional scalar rewards. The authors formalize the LLF problem and introduce the transfer eluder dimension as a complexity measure to quantify how effectively language feedback reduces uncertainty about latent rewards, demonstrating cases where learning can be exponentially faster than reward-only methods. They propose a no-regret algorithm called HELiX that provably solves LLF problems and empirically show that a practical implementation using LLMs outperforms greedy baselines across several environments. Overall, the work establishes a theoretical foundation for designing principled interactive learning algorithms that leverage generic language feedback, positioning LLF as a broad paradigm encompassing existing reinforcement learning models.

Comments

In Channel

Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL

2025-10-2214:33

Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior

2025-10-2219:04

A Definition of AGI

2025-10-2216:28

Provably Learning from Language Feedback

2025-10-2119:55

In-Context Learning for Pure Exploration

2025-10-2116:30

On the Role of Preference Variance in Preference Optimization

2025-10-2014:42

Training LLM Agents to Empower Humans

2025-10-2013:38

Richard Sutton Declares LLMs a Dead End

2025-10-2013:20

Demystifying Reinforcement Learning in Agentic Reasoning

2025-10-1915:21

Emergent coordination in multi-agent language models

2025-10-1913:57

Learning-to-measure: in-context active feature acquisition

2025-10-1916:02

Andrej Karpathy's insights: AGI, Intelligence, and Evolution

2025-10-1916:11

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

2025-10-1812:48

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

2025-10-1817:02

The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injections

2025-10-1816:08

When can in-context learning generalize out of task distribution?

2025-10-1619:44

The Art of Scaling Reinforcement Learning Compute for LLMs

2025-10-1613:41

A small number of samples can poison LLMs of any size

2025-10-1613:58

Dual Goal Representations

2025-10-1417:11

Welcome to the Era of Experience

2025-10-1416:42

00:00

Provably Learning from Language Feedback

#box-pro-ellipsis-176114193759591{-webkit-line-clamp:2;}Provably Learning from Language Feedback

Provably Learning from Language Feedback

Enoch H. Kang

Provably Learning from Language Feedback