DiscoverInfinite ML with Prateek JoshiVoice-to-Voice Foundation Models
Voice-to-Voice Foundation Models

Voice-to-Voice Foundation Models

Update: 2024-10-30
Share

Description

Alan Cowen is the cofounder and CEO of Hume, a company building voice-to-voice foundation models. They recently raised their $50M Series B from Union Square Ventures, Nat Friedman, Daniel Gross, and others.

Alan's favorite book: 1984 (Author: George Orwell)

(00:01 ) Introduction
(00:06 ) Defining Voice-to-Voice Foundation Models
(01:26 ) Historical Context: Handling Voice and Speech Understanding
(03:54 ) Emotion Detection in Voice AI Models
(04:33 ) Training Models to Recognize Human Emotion in Speech
(07:19 ) Cultural Variations in Emotional Expressions
(09:00 ) Semantic Space Theory in Emotion Recognition
(12:11 ) Limitations of Basic Emotion Categories
(15:50 ) Recognizing Blended Emotional States
(20:15 ) Objectivity in Emotion Science
(24:37 ) Practical Aspects of Deploying Voice AI Systems
(28:17 ) Real-Time System Constraints and Latency
(31:30 ) Advancements in Voice AI Models
(32:54 ) Rapid-Fire Round

--------
Where to find Prateek Joshi:

Newsletter: https://prateekjoshi.substack.com 
Website: https://prateekj.com 
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 
Twitter: https://twitter.com/prateekvjoshi 

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Voice-to-Voice Foundation Models

Voice-to-Voice Foundation Models

Prateek Joshi