While everyone focuses on the "brains" of AI, Russ d'Sa, CEO of LiveKit is solving the critical infrastructure that lets AI systems actually communicate with humans. In this conversation, we dive into the real-time protocols powering voice AI experiences like ChatGPT and Character AI.
Russ explains why voice interaction feels unnatural today and the technical challenges behind "turn detection" - knowing when to speak, listen, or interrupt in a conversation. We explore why milliseconds matter in voice AI, how adding visual context improves conversational dynamics, and the future split between AI "copilots" for creative work versus "autopilots" for routine tasks. Russ also shares candid insights on finding product-market fit before raising capital.
If you're interested in the infrastructure making AI feel more human or the future of human-machine interaction, this conversation offers a rare look behind the curtain. Listen to the full episode to understand what it takes to build the nervous system for AI.
Looking for more tech, data and venture capital intel? Head to WorldofDaaS.com for our podcast, newsletter and events.
We post videos every week on all things tech, data, and venture capital, so make sure you subscribe, and follow @WorldOfDaaS
You can find Auren Hoffman on X at @auren and Russ d’Sa on X at @dsa.