PIOPIY AI

Telephonic-Grade Voice AI — WebRTC-Ready Framework

Piopiy AI is an open-source, telephony-grade framework for building real-time voice agents that blend large language models (LLM), automatic speech recognition (ASR), and text-to-speech (TTS) engines. Purchase numbers, configure agents, and let Piopiy handle call routing, audio streaming, and connectivity while you focus on conversation design. Combine cloud or open-source providers to tailor the voice stack to your latency, privacy, and cost targets.

Installation

Requires Python 3.10+.

pip install piopiy-ai

To install extras for the providers you plan to use:

pip install "piopiy-ai[cartesia,deepgram,openai]"

Set provider API keys in the environment (for example, OPENAI_API_KEY).

Quick Example

import asyncio
import os

from piopiy.agent import Agent
from piopiy.voice_agent import VoiceAgent
from piopiy.services.deepgram.stt import DeepgramSTTService
from piopiy.services.openai.llm import OpenAILLMService
from piopiy.services.cartesia.tts import CartesiaTTSService


async def create_session():
    voice_agent = VoiceAgent(
        instructions="You are an advanced voice AI.",
        greeting="Hello! How can I help you today?",
    )

    stt = DeepgramSTTService(api_key=os.getenv("DEEPGRAM_API_KEY"))
    llm = OpenAILLMService(api_key=os.getenv("OPENAI_API_KEY"))
    tts = CartesiaTTSService(api_key=os.getenv("CARTESIA_API_KEY"))

    await voice_agent.Action(stt=stt, llm=llm, tts=tts)
    await voice_agent.start()


async def main():
    agent = Agent(
        agent_id=os.getenv("AGENT_ID"),
        agent_token=os.getenv("AGENT_TOKEN"),
        create_session=create_session,
    )
    await agent.connect()


if __name__ == "__main__":
    asyncio.run(main())

Advanced Usage & Dynamic Switching

Piopiy AI supports advanced features like switching providers mid-call (e.g., swapping TTS voices or STT models based on user commands).

Check out the Switching Providers Examples to see how to implement dynamic provider switching with ServiceSwitcher.

Supported Providers

Piopiy AI supports 40+ providers. Here are some of the most popular ones:

LLM: OpenAI, Anthropic, Google Gemini, Groq, unsloth (via Ollama)
STT: Deepgram, Speechmatics, Google, Azure, AssemblyAI, Whisper
TTS: ElevenLabs, Cartesia, PlayHT, Azure, Google, Rime

👉 See the full list of Supported Providers

Interruption & Silero VAD

Enable interruption handling with Silero voice activity detection:

pip install "piopiy-ai[silero]"

Silero VAD detects speech during playback, allowing callers to interrupt the agent.

Open-Source Voice Stack (LLM + ASR + TTS)

Pair Piopiy’s realtime orchestration with open-source engines across the full speech stack:

Layer	Default	Alternatives
LLM	Ollama running `llama3.1` (or another local model)	LM Studio, GPT4All via Ollama-compatible APIs
ASR	`WhisperSTTService` with Whisper small/medium models	`mlx-whisper` for Apple silicon
TTS	`ChatterboxTTSService` pointed at a self-hosted Chatterbox TTS server	Piper, XTTS, Kokoro

Install the optional dependencies and runtimes:

pip install "piopiy-ai[whisper]"
# Install and run Ollama separately: https://ollama.ai
# Start the Chatterbox TTS WebSocket server (https://github.com/piopiy-ai/chatterbox-tts)

Example session factory using the open-source trio:

from piopiy.voice_agent import VoiceAgent
from piopiy.services.whisper.stt import WhisperSTTService
from piopiy.services.ollama.llm import OLLamaLLMService
from piopiy.services.opensource.chatterbox.tts import ChatterboxTTSService


async def create_session():
    voice_agent = VoiceAgent(
        instructions="You are a helpful local-first voice assistant.",
        greeting="Hi there! Running fully on open-source models today.",
    )

    stt = WhisperSTTService(model="small")
    llm = OLLamaLLMService(model="llama3.1")  # points to your local Ollama runtime
    tts = ChatterboxTTSService(base_url="ws://localhost:6078")

    await voice_agent.Action(stt=stt, llm=llm, tts=tts, vad=True)
    await voice_agent.start()

Swap in other open-source providers such as Piper, XTTS, or Kokoro for TTS, and adjust the Chatterbox base URL or voice ID for your deployment. You can also run Whisper on Apple silicon with the mlx-whisper extra. Piopiy's abstraction layer lets you mix these with managed services whenever needed.

Telephony Integration

Connect phone calls in minutes using the Piopiy dashboard:

Sign in at dashboard.piopiy.com and purchase a phone number.
Create a voice AI agent to receive AGENT_ID and AGENT_TOKEN.
Use those credentials with the SDK for instant connectivity.

No SIP setup or third-party telephony vendors are required—Piopiy handles the calls so you can focus on your agent logic.

Thanks to Pipecat for making client SDK implementation easy.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
docs		docs
example		example
script		script
src/piopiy		src/piopiy
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
debug_import.py		debug_import.py
download_kokoro.sh		download_kokoro.sh
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PIOPIY AI

Installation

Quick Example

Advanced Usage & Dynamic Switching

Supported Providers

Interruption & Silero VAD

Open-Source Voice Stack (LLM + ASR + TTS)

Telephony Integration

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

telecmi/agents

Folders and files

Latest commit

History

Repository files navigation

PIOPIY AI

Installation

Quick Example

Advanced Usage & Dynamic Switching

Supported Providers

Interruption & Silero VAD

Open-Source Voice Stack (LLM + ASR + TTS)

Telephony Integration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages