Mixture-of-Recursions (MoR)

Update: 2025-07-18

Description

Mixture-of-Recursions (MoR) is a unified framework built on a Recursive Transformer architecture, designed to enhance the efficiency of large language models. It achieves this by combining three core paradigms: parameter sharing (reusing shared layers across recursion steps), adaptive computation (dynamically assigning different processing depths to individual tokens via lightweight routers), and efficient Key-Value (KV) caching (selectively storing or sharing KV pairs). This integrated approach enables MoR to deliver large-model quality with significantly reduced computational and memory overhead, improving efficiency for both training and inference.

Comments

In Channel

Kimi K2

2025-07-2215:30

Mixture-of-Recursions (MoR)

2025-07-1816:43

MeanFlow

2025-07-1006:47

Mamba

2025-07-1008:14

LLM Alignment

2025-06-1420:06

Why We Think

2025-05-2014:20

Deep Research

2025-05-1211:35

vLLM

2025-05-0413:06

Qwen3: Thinking Deeper, Acting Faster

2025-05-0413:15

RAGEN: train and evaluate LLM agents using multi-turn RL

2025-05-0311:56

DeepSeek-Prover-V2

2025-05-0111:04

DeepSeek-Prover

2025-05-0108:37

Model Context Protocol (MCP)

2025-04-0913:36

LLM Post-Training: Reasoning

2025-03-1722:18

Agent AI Overview

2025-03-1721:06

FlashAttention-3

2025-03-0713:43

FlashAttention-2

2025-03-0510:50

FlashAttention

2025-03-0510:55

PPO (Proximal Policy Optimization)

2025-02-1513:42

"Deep Dive into LLMs like ChatGPT" - Andrej Karpathy's Tech Talk Learning

2025-02-1518:10

00:00

Mixture-of-Recursions (MoR)

#box-pro-ellipsis-176802982432312{-webkit-line-clamp:2;}Mixture-of-Recursions (MoR)

Mixture-of-Recursions (MoR)

AI-Talk

Mixture-of-Recursions (MoR)