A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Feb 18, 2026 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro or Typecast
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
Eleven Labs text to speech package for NodeJS. You can use the official package at: https://www.npmjs.com/package/elevenlabs
List of open-source TTS, voice cloning, and music generation models
🦆💰 A bot that uses Uberduck (and FakeYou) AI to make bit donations have an AI voice.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
Voice AI agent for reactivating cold leads through personalized calls, assessing their interest with AI agents, and syncing insights directly to your CRMs.
Voice-powered AI assistant platform — connect any LLM, any TTS, with a live web canvas, music generation, and agent orchestration using openclaw. Install: npx openvoiceui setup
Преобразование голоса на основе VITS. Ориентировано на простоту, качество и производительность.
Free voice cloning for creators using Coqui XTTS-v2 on Google Colab. Clone your voice with just a few minutes of audio. Complete guide to build your own notebook.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source code for the open-source TTS models, including the removed 7B version. Try the VibeVoice online service
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.
Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA 12.8). One-click install.
Add a description, image, and links to the ai-voice topic page so that developers can more easily learn about it.
To associate your repository with the ai-voice topic, visit your repo's landing page and select "manage topics."