DiscoverThis Machine KillsPatreon Preview – 358. *Se7en Voice* What’s in the Benchmark?
Patreon Preview – 358. *Se7en Voice* What’s in the Benchmark?

Patreon Preview – 358. *Se7en Voice* What’s in the Benchmark?

Update: 2024-08-07
Share

Description

We first get an update on regulatory arbitrage in the weed vape industry, then discuss how the benchmarks used to rank AI models—and make claims about their "intelligence" relative to humans—are largely low quality, out-of-date, not fit for purpose, or just meaningless and deceptive. Yet they are widely treated by industry as authoritative standards. Then we talk a bit about yet another case of a risk scoring algorithm resulting in devastating consequences.

••• Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless https://themarkup.org/artificial-intelligence/2024/07/17/everyone-is-judging-ai-by-these-tests-but-experts-say-theyre-close-to-meaningless
••• An Algorithm Told Police She Was Safe. Then Her Husband Killed Her. https://www.nytimes.com/interactive/2024/07/18/technology/spain-domestic-violence-viogen-algorithm.html

Subscribe to hear more analysis and commentary in our premium episodes every week! https://www.patreon.com/thismachinekills

Hosted by Jathan Sadowski (www.twitter.com/jathansadowski) and Edward Ongweso Jr. (www.twitter.com/bigblackjacobin). Production / Music by Jereme Brown (www.twitter.com/braunestahl)
Comments 
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Patreon Preview – 358. *Se7en Voice* What’s in the Benchmark?

Patreon Preview – 358. *Se7en Voice* What’s in the Benchmark?

This Machine Kills