DiscoverConvo AI WorldOpen-Source Voice Activity Detection with TEN Framework's Ziyi Lin
Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Update: 2025-09-24
Share

Description

Ziyi Lin, speech engineer on the TEN Framework team, joins the Convo AI World podcast to explore the design and impact of a new open-source Voice Activity Detection (VAD) model. The episode explores the challenges faced with existing VAD solutions, the importance of high-quality training data, and the design choices that led to improved performance metrics. Ziyi explains how VAD functions as a critical component in conversational AI, managing real-time processing and latency, and the advantages of deploying it on edge devices.

Check out video episodes and subscribe to the Convo AI Newsletter at podcast.convoai.world
Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Agora