Pipeline Overview
1
Your Voice Agent Pipeline
You manage media transport, turn detection, STT, LLM, and TTS components
2
Beyond Presence Speech-to-Video API
Receives audio input from your pipeline
3
Avatar Video Output
Beyond Presence manages avatar generation and video streaming
Supported Frameworks
We support integration with popular voice agent frameworks including LiveKit and Pipecat, allowing you to add avatar video to your existing voice pipelines.LiveKit Plugin
Add avatars to your LiveKit agents with our plugin
Pipecat Service
Integrate with Pipecat framework for avatar video
When to Use This
Choose speech-to-video when you need:- Full control: Complete management of turn detection, STT, LLM, and TTS components
- Existing pipelines: Integration with current voice agent infrastructure