Prerequisites
| Service | What You Need |
|---|---|
| Plivo | Auth ID, Auth Token, Voice-enabled phone number |
| Deepgram | API key from console.deepgram.com |
| OpenAI | API key from platform.openai.com |
| ElevenLabs | API key from elevenlabs.io |
Installation
Environment Variables
Pipeline Configuration
Service Details
Deepgram STT
Real-time speech recognition with interim results and language detection.| Option | Description |
|---|---|
DeepgramSTTService | Standard WebSocket transcription |
DeepgramFluxSTTService | Enhanced turn detection for conversations |
OpenAI LLM
Chat completion with GPT-4o supporting streaming responses and function calling.| Model | Description |
|---|---|
gpt-4o | Most capable, multimodal |
gpt-4o-mini | Faster, cost-effective |
gpt-4-turbo | Previous generation |
ElevenLabs TTS
Natural voice synthesis with word-level timing and voice cloning support.| Feature | Description |
|---|---|
| WebSocket streaming | Real-time audio with low latency |
| Word-level timing | Precise synchronization |
| Voice cloning | Create custom voices |
| Multilingual | 29+ languages supported |
ElevenLabsTTSService- WebSocket-based, recommended for real-timeElevenLabsHttpTTSService- HTTP-based, simpler setup
Quick Start
Inbound Calls
Outbound Calls
Related
- Pipecat Overview - Architecture and setup
- Deepgram Docs - STT configuration
- OpenAI Docs - LLM configuration
- ElevenLabs Docs - TTS configuration