Track: WebRTC and Real-Time Applications |
Voice and Conversational AI In Production with RTT less than 300ms |
Large Language Models (LLMs) are transforming voice interactions, enabling multi-turn conversations that are engaging and practical. In this talk, we’ll explore how at Daily we integrate WebRTC with LLMs for real-time voice-to-voice communication using our open standard, RTVI-AI. RTVI-AI defines real-time APIs for applications such as voice chats with LLMs, enterprise workflows in healthcare, video avatars, and voice-driven user interfaces. Daily's open-source voice engine integrates speech-to-text, LLMs, and text-to-speech, optimized for low-latency performance—currently at 500ms, aiming for 300ms. This presentation will demonstrate how we leverage these technologies to create seamless, real-time voice interactions for various use cases. |
|