Listen Top Shows Blog

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

Update: 2025-08-28

Share

Description

Deepgram's VP of Research Andrew Seagraves joins to explore the science and engineering behind modern speech recognition systems. Hermes and Andrew dive deep into why speech recognition isn't a solved problem, the two-stage training process of speech-to-text models, and the challenges of balancing real-time latency with accuracy. The conversation covers Deepgram's origins from dark matter research, power laws in speech data, buffer-based architectures for real-time transcription, and frontier challenges like multilingual code-switching, emotion detection, and conversational dynamics. Andrew shares insights on model deployment, customer use cases from NASA to food ordering, and the future of self-adapting speech models.

Check out video episodes and subscribe to the Convo AI Newsletter at podcast.convoai.world

Comments

In Channel

From Code to Cosmos: The First AI Astrologer

From Code to Cosmos: The First AI Astrologer

2025-12-1701:03:34

Meet Fuzozo: The Pocket-Sized Robot Ending the Loneliness Crisis

Meet Fuzozo: The Pocket-Sized Robot Ending the Loneliness Crisis

2025-12-1118:50

Agnes AI: This is How You Win Southeast Asia

Agnes AI: This is How You Win Southeast Asia

2025-12-0354:15

Reimagining the Future of Learning through Conversational AI with Physics Wallah's Supreet Singh

Reimagining the Future of Learning through Conversational AI with Physics Wallah's Supreet Singh

2025-11-1934:32

Relatability Over Perfection in Voice AI with Rime's Lily Clifford

Relatability Over Perfection in Voice AI with Rime's Lily Clifford

2025-11-1254:24

Humanizing Learning in the Age of AI with Colabery's Ram Katamaraja

Humanizing Learning in the Age of AI with Colabery's Ram Katamaraja

2025-10-2947:38

Redefining Live Entertainment and the Creator Economy with Eloelo's Sagar Gaonkar

Redefining Live Entertainment and the Creator Economy with Eloelo's Sagar Gaonkar

2025-10-2238:35

Real-Time Avatars, Translation, and Visual Storytelling with Akool's Jeff Lu

Real-Time Avatars, Translation, and Visual Storytelling with Akool's Jeff Lu

2025-10-1539:14

AI at the Edge: 6G, Arabic LLMs & the Middle East’s AI Leap with Mérouane Debbah

AI at the Edge: 6G, Arabic LLMs & the Middle East’s AI Leap with Mérouane Debbah

2025-10-0839:39

The Voice AI and VR Revolution in Heavy Machinery with Carbon Origins' Amogha

The Voice AI and VR Revolution in Heavy Machinery with Carbon Origins' Amogha

2025-10-0151:12

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

2025-09-2433:38

Building AI Community with Voice AI Space

Building AI Community with Voice AI Space

2025-09-1040:42

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

2025-08-2801:06:02

AI Content Moderation with Google's Ninny Wan

AI Content Moderation with Google's Ninny Wan

2025-08-1337:01

Interactive Digital Avatars with Trulience's Richard Bowdler

Interactive Digital Avatars with Trulience's Richard Bowdler

2025-07-3044:29

Real-Time Translation with Palabra's Artem Kukharenko and Ivan Kuzin

Real-Time Translation with Palabra's Artem Kukharenko and Ivan Kuzin

2025-07-1530:47

Introduction to Conversational AI with Agora's Ben Weekes

Introduction to Conversational AI with Agora's Ben Weekes

2025-07-1131:08

00:00

00:00

x

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

Agora