Data at scale: Shaping the AI landscape with Scale AI

Update: 2024-11-21

Description

In this episode of Get the Check, the hosts discuss Scale AI's role in the AI ecosystem. With AI models requiring high-quality, labeled data to perform at their best, Scale AI has positioned itself at the forefront of creating quality data. The hosts explore Scale's journey, from using contractors to scale its data labeling operations to long-standing partnerships with the DoD and OpenAI.

The hosts break down the three pillars of AI—compute, data, and algorithms—and take a closer look at Scale’s history, its innovative products, and the controversies surrounding labor in data labeling. They dive into training data such as input-output pairs, reinforcement learning from human feedback (RLHF), and the workflow data needed to power the shift from generative AI to agentic AI. Additionally, they touch on Scale’s new offerings, including expert data labeling, ML ops for enterprises, and the defense-focused LLAMA model, a collaboration with Meta to power U.S. military AI capabilities.

Tune in for insights on how Scale AI is leveraging human expertise to create high-quality datasets that power everything from autonomous vehicles to defense technologies. You can follow @getthecheckpod on all socials. Stay tuned for next week’s episode on Scale AI!

00:00 Pillars of AI

01:30 What is data labeling?

04:46 Can synthetic data replace real data?

07:37 Classes of data

12:04 Founding story

18:39 Scale Donovan

22:33 Using private data for agentic AI

27:19 SEAL LLM Leaderboard

30:30 Expert Match

32:34 Hiring at Scale AI

38:02 Restaurant pick of the week

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

2024 tech highlights: Cyberwarfare, Google’s potential breakup, Databricks acquires Tabular, and OpenAI’s evolution

2024-12-1945:17

Coining the future: Coinbase's crypto ecosystem

2024-12-1257:09

From data to dominance: Palantir’s playbook for modern intelligence

2024-12-0538:35

Data at scale: Shaping the AI landscape with Scale AI

2024-11-2140:29

Playing cupid: Match Group’s reign over dating in the digital age

2024-11-1436:36

It’s not rocket science: How SpaceX took off

2024-11-0735:50

Betting on the ballot: Prediction markets on Kalshi

2024-10-3138:59

Enemies to lovers: The Uber and Waymo partnership

2024-10-2430:12

Periods to profit: How Flo hit a billion dollars

2024-10-1728:30

Pilot Episode: BILT for success?

2024-10-0830:26

00:00

Data at scale: Shaping the AI landscape with Scale AI

Anika, Maya, Priya, Vidushi

#box-pro-ellipsis-173510462075231{-webkit-line-clamp:2;}Data at scale: Shaping the AI landscape with Scale AI

Data at scale: Shaping the AI landscape with Scale AI

Anika, Maya, Priya, Vidushi

Data at scale: Shaping the AI landscape with Scale AI