Designing Voicebots that Feel Human: Ecosmob’s Approach to Real-Time Conversational AI, Podcast
Description
“If responses aren’t near real-time, the bot won’t feel human.” — Ruchir Brahmbhatt, Co-Founder & CTO, Ecosmob
Ruchir Brahmbhatt, Co-Founder and CTO of Ecosmob, joined Doug Green, Publisher of Technology Reseller News, to discuss the engineering behind human-like voicebots—where milliseconds make the difference between a smooth conversation and a frustrating one.
With more than 18 years in VoIP and AI/ML development, Ecosmob builds custom voicebots for MSPs, ITSPs, and UCaaS/CCaaS providers seeking real-time automation and compliance. Brahmbhatt outlined how Ecosmob’s architecture achieves sub-second latency through:
- Python async orchestration for thousands of concurrent sessions
- Redis in-memory queues for ultra-low-latency streaming
- NVIDIA Canary ASR and Kokoro TTS for fast, natural speech
- llama.cpp LLM engine with dynamic quantization for efficient processing
In a live healthcare demo, Ecosmob’s voicebot scheduled an appointment in natural, human-like dialogue—with total round-trip latency under 600 milliseconds.
Brahmbhatt emphasized that modern contact centers are shifting from IVRs to AI-driven self-service, and that on-prem and GDPR-compliant deployments are increasingly essential.
Learn more at ecosmob.com.