DiscoverAI Frontiers“How AI Can Degrade Human Performance in High-Stakes Settings” by Dane A. Morey, Mike Rayo, David Woods
“How AI Can Degrade Human Performance in High-Stakes Settings” by Dane A. Morey, Mike Rayo, David Woods

“How AI Can Degrade Human Performance in High-Stakes Settings” by Dane A. Morey, Mike Rayo, David Woods

Update: 2025-07-16
Share

Description

Last week, the AI nonprofit METR published an in-depth study on human-AI collaboration that stunned experts. It found that software developers with access to AI tools took 19% longer to complete their tasks, despite believing they had finished 20% faster. The findings shed important light on our ability to predict how AI capabilities interact with human skills.

Since 2020, we have been conducting similar studies on human-AI collaboration, but in contexts with much higher stakes than software development. Alarmingly, in these safety-critical settings, we found that access to AI tools can cause humans to perform much, much worse.

A 19% slowdown in software development can eat into profits. Reduced performance in safety-critical settings can cost lives.

Safety-Critical Scenarios

Imagine that you’re aboard a passenger jet on its final approach into San Francisco. Everything seems ready for a smooth landing — until an AI-infused weather monitor misses a sudden microburst. [...]

---

Outline:

(01:04 ) Safety-Critical Scenarios

(03:30 ) How Current Safety Frameworks Fail

(05:43 ) AI Influences Humans to Perform Slightly Better... or Much, Much Worse

(08:50 ) A Clear Pattern in Human-AI Collaboration

(09:49 ) Three Rules for Better Evaluations

(11:43 ) Faster, Easier, and Earlier Evaluations

(13:48 ) Toward Responsible Deployments of AI

---


First published:

July 16th, 2025



Source:

https://aifrontiersmedia.substack.com/p/how-ai-can-degrade-human-performance


---


Narrated by TYPE III AUDIO.


---

Images from the article:

Impact of AI-augmentation shown as the percentage change in nurses’ concern relative to how well they distinguished emergency from non-emergency patients without AI. Positive values reflect improved judgment; negative values reflect reduced judgment. Source: npj Digital Medicine

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

“How AI Can Degrade Human Performance in High-Stakes Settings” by Dane A. Morey, Mike Rayo, David Woods

“How AI Can Degrade Human Performance in High-Stakes Settings” by Dane A. Morey, Mike Rayo, David Woods