Dan Hendrycks on Catastrophic AI Risks

Update: 2023-11-03

Description

Dan Hendrycks joins the podcast again to discuss X.ai, how AI risk thinking has evolved, malicious use of AI, AI race dynamics between companies and between militaries, making AI organizations safer, and how representation engineering could help us understand AI traits like deception. You can learn more about Dan's work at https://www.safe.ai

Timestamps:
00:00 X.ai - Elon Musk's new AI venture
02:41 How AI risk thinking has evolved
12:58 AI bioengeneering
19:16 AI agents
24:55 Preventing autocracy
34:11 AI race - corporations and militaries
48:04 Bulletproofing AI organizations
1:07:51 Open-source models
1:15:35 Dan's textbook on AI safety
1:22:58 Rogue AI
1:28:09 LLMs and value specification
1:33:14 AI goal drift
1:41:10 Power-seeking AI
1:52:07 AI deception
1:57:53 Representation engineering

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

How Close Are We to AGI? Inside Epoch's GATE Model (with Ege Erdil)

2025-03-2801:34:33

Special: Defeating AI Defenses (with Nicholas Carlini and Nathan Labenz)

2025-03-2102:23:12

Keep the Future Human (with Anthony Aguirre)

2025-03-1301:21:03

We Created AI. Why Don't We Understand It? (with Samir Varma)

2025-03-0601:16:15

Why AIs Misbehave and How We Could Lose Control (with Jeffrey Ladish)

2025-02-2701:22:33

Ann Pace on using Biobanking and Genomic Sequencing to Conserve Biodiversity

2025-02-1446:09

Michael Baggot on Superintelligence and Transhumanism from a Catholic Perspective

2025-01-2401:25:56

David Dalrymple on Safeguarded, Transformative AI

2025-01-0901:40:06

Nick Allardice on Using AI to Optimize Cash Transfers and Predict Disasters

2024-12-1901:09:26

Nathan Labenz on the State of AI and Progress since GPT-4

2024-12-0503:20:04

Connor Leahy on Why Humanity Risks Extinction from AGI

2024-11-2201:58:50

Suzy Shepherd on Imagining Superintelligence and "Writing Doom"

2024-11-0801:03:08

Andrea Miotti on a Narrow Path to Safe, Transformative AI

2024-10-2501:28:09

Tamay Besiroglu on AI in 2030: Scaling, Automation, and AI Agents

2024-10-1101:30:29

Ryan Greenblatt on AI Control, Timelines, and Slowing Down Around Human-Level AI

2024-09-2702:08:44

Tom Barnes on How to Build a Resilient World

2024-09-1201:19:41

Samuel Hammond on why AI Progress is Accelerating - and how Governments Should Respond

2024-08-2202:16:11

Anousheh Ansari on Innovation Prizes for Space, AI, Quantum Computing, and Carbon Removal

2024-08-0901:03:10

Mary Robinson (Former President of Ireland) on Long-View Leadership

2024-07-2530:01

Emilia Javorsky on how AI Concentrates Power

2024-07-1101:03:35

00:00

1.0x

Dan Hendrycks on Catastrophic AI Risks

Gus Docker

We and our partners use cookies to personalize your experience, to show you ads based on your interests, and for measurement and analytics purposes. By using our website and our services, you agree to our use of cookies as described in our Cookie Policy.

#box-pro-ellipsis-174368875746245{-webkit-line-clamp:2;}Dan Hendrycks on Catastrophic AI Risks

Dan Hendrycks on Catastrophic AI Risks

Gus Docker

Dan Hendrycks on Catastrophic AI Risks