AISN #47: Reasoning Models

Update: 2025-02-06

Description

Plus, State-Sponsored AI Cyberattacks.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

Reasoning Models

DeepSeek-R1 has been one of the most significant model releases since ChatGPT. After its release, the DeepSeek's app quickly rose to the top of Apple's most downloaded chart and NVIDIA saw a 17% stock decline. In this story, we cover DeepSeek-R1, OpenAI's o3-mini and Deep Research, and the policy implications of reasoning models.

DeepSeek-R1 is a frontier reasoning model. DeepSeek-R1 builds on the company's previous model, DeepSeek-V3, by adding reasoning capabilities through reinforcement learning training. R1 exhibits frontier-level capabilities in mathematics, coding, and scientific reasoning—comparable to OpenAI's o1. DeepSeek-R1 also scored 9.4% on Humanity's Last Exam—at the time of its release, the highest of any publicly available system.

DeepSeek reports spending only about $6 million on the computing power needed to train V3—however, that number doesn’t include the full [...]

---

Outline:

(00:13 ) Reasoning Models

(04:58 ) State-Sponsored AI Cyberattacks

(06:51 ) Links

---

First published:

February 6th, 2025

Source:

https://newsletter.safe.ai/p/ai-safety-newsletter-47-reasoning

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

AISN #62: Big Tech Launches $100 Million pro-AI Super PAC

2025-08-2710:16

AISN #61: OpenAI Releases GPT-5

2025-08-1209:13

AISN #60: The AI Action Plan

2025-07-3115:41

AISN #59: EU Publishes General-Purpose AI Code of Practice

2025-07-1509:23

AISN #58: Senate Removes State AI Regulation Moratorium

2025-07-0309:04

AISN #57: The RAISE Act

2025-06-1707:12

AISN #56: Google Releases Veo 3

2025-05-2808:37

AISN #55: Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to Gulf States

2025-05-2009:18

AISN #54: OpenAI Updates Restructure Plan

2025-05-1308:40

AISN #53: An Open Letter Attempts to Block OpenAI Restructuring

2025-04-2910:39

AISN #52: An Expert Virology Benchmark

2025-04-2210:10

AISN #51: AI Frontiers

2025-04-1512:09

AISN #50: AI Action Plan Responses

2025-03-3112:25

AISN #49: AI Action Plan Responses

2025-03-3112:25

AISN

2025-03-0611:31

Superintelligence Strategy: Expert Version

2025-03-0502:10:37

Superintelligence Strategy: Standard Version

2025-03-05--:--

AISN #48: Utility Engineering and EnigmaEval

2025-02-1808:56

AISN #47: Reasoning Models

2025-02-0609:00

AISN #46: The Transition

2025-01-2311:20

00:00

AISN #47: Reasoning Models

#box-pro-ellipsis-175807772747190{-webkit-line-clamp:2;}AISN #47: Reasoning Models

AISN #47: Reasoning Models

AISN #47: Reasoning Models