DeepSeek-Prover-V2

DeepSeek-Prover-V2

Update: 2025-05-01
Share

Description

DeepSeek-Prover-V2 is an open-source large language model designed for formal theorem proving in Lean 4. Its training relies heavily on synthetic data, generated by using DeepSeek-V3 to decompose problems into subgoals, which are then recursively solved by a smaller 7B prover model. The model uses a two-stage training process, including supervised fine-tuning and reinforcement learning (GRPO), to bridge informal reasoning with formal proofs. It achieves state-of-the-art performance, particularly with its high-precision Chain-of-Thought mode.

Comments 
loading
In Channel
Kimi K2

Kimi K2

2025-07-2215:30

MeanFlow

MeanFlow

2025-07-1006:47

Mamba

Mamba

2025-07-1008:14

LLM Alignment

LLM Alignment

2025-06-1420:06

Why We Think

Why We Think

2025-05-2014:20

Deep Research

Deep Research

2025-05-1211:35

vLLM

vLLM

2025-05-0413:06

DeepSeek-Prover-V2

DeepSeek-Prover-V2

2025-05-0111:04

DeepSeek-Prover

DeepSeek-Prover

2025-05-0108:37

Agent AI Overview

Agent AI Overview

2025-03-1721:06

FlashAttention-3

FlashAttention-3

2025-03-0713:43

FlashAttention-2

FlashAttention-2

2025-03-0510:50

FlashAttention

FlashAttention

2025-03-0510:55

loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

DeepSeek-Prover-V2

DeepSeek-Prover-V2

AI-Talk