Build an AI Voice Agent by Integrating Plivo with Deepgram, OpenAI, and ElevenLabs

Plivo-powered AI voice agents can transform your business operations by enabling natural, intelligent conversations with your customers. By integrating Plivo’s Voice API with Deepgram for speech recognition, OpenAI for conversation processing, and ElevenLabs for natural speech synthesis, you can create sophisticated voice agents that handle customer interactions efficiently.

This integration guide will help you build a voice agent that can:

Transcribe live customer audio in real-time
Generate contextual responses using AI
Convert responses to natural-sounding speech
Maintain fluid conversations with built-in voice activity detection

Get Started with Plivo

Before developing your AI voice agent, Sign up for Plivo or sign in to your existing account. Purchase a number through the Voice API or Plivo console to receive customer calls.

Prerequisites

You’ll need active accounts and API keys from:

Deepgram for speech-to-text conversion
OpenAI for conversational AI
ElevenLabs for text-to-speech synthesis

Clone the Plivo audio stream integration guides repository

git clone https://github.com/plivo/AI-Voice-Agents.git
cd AI-Voice-Agents/Deepgram-openai-elevenlabs

Was this code helpful

Setting Up Your Development Environment

First, create a Python virtual environment and install the required dependencies:

pip install virtualenv
virtualenv -p /usr/bin/python3 venv
source venv/bin/activate

Was this code helpful

Install requirements

1. Install Python Requirements

pip install -r requirements.txt

Was this code helpful

2. Install FFmpeg for audio processing by downloading it from the official website.

Configure Your Environment

Create a config.json file with your API credentials:

{
    "auth_id": "your_plivo_auth_id",
    "auth_token": "your_plivo_auth_token",
    "github_token": "your_github_token",
    "deepgram_api_key": "your_deepgram_api_key",
    "openai_api_key": "your_openai_api_key",
    "elevenlabs_api_key": "your_elevenlabs_api_key"
}

Was this code helpful

Customize Your Voice Agent’s Personality

Define your voice agent’s behavior by updating the LLM prompt. You can update the LLM prompt by updating your system message in server.py:

system_msg = """You are John Doe, a voice agent that helps in resolving 
general queries related to any field. When someone says hello, you will greet them and answer their questions in a polite way."""

Was this code helpful

Set Up Local Development Environment

To test your application locally:

1. Install ngrok:

brew install ngrok  # for macOS
# or download from https://ngrok.com/download for other OS

Was this code helpful

2. Start an ngrok tunnel to expose your local server:

ngrok http 5000

Was this code helpful

3. Launch your application:

python server.py

Was this code helpful

Configure Your Plivo Number

Set up your Plivo number to handle incoming calls using the command:

python number_setup.py your-plivo-number ws://[your-ngrok-subdomain].ngrok.app/stream

Was this code helpful

Replace the placeholders with your actual Plivo number and ngrok URL received from step 1.

Testing Your Voice Agent

Once configured, your voice agent is ready to handle calls. Place a test call to your Plivo number to interact with your AI-powered voice agent. The agent will:

Listen for incoming audio using voice activity detection
Transcribe speech when a pause is detected
Generate an AI response based on the transcription
Convert the response to natural speech
Play the response back to the caller

This integration enables seamless, natural conversations between your customers and AI voice agents, powered by Plivo’s reliable voice infrastructure.

For additional support:

Visit Plivo Documentation.
Contact Plivo Support for technical assistance.

Rate this page

🥳 Thank you! It means a lot to us!