🎙️ Discord Voice Bot

Real-time AI voice chat in Discord voice channels. Talk to an AI assistant like a phone call.

[中文] Discord 语音频道 AI 实时语音对话机器人，像打电话一样和 AI 聊天。

How It Works / 工作原理

User speaks → Discord PCM audio → Silence detection → Whisper STT
                                                         ↓
                                                   LLM generates reply
                                                         ↓
Play audio  ←  FFmpeg decode  ←  Edge-TTS synthesis  ←  AI text reply

Features / 功能特性

🎤 Real-time Voice Chat — Speak naturally in Discord voice channels
🧠 LLM-powered Responses — Uses any OpenAI-compatible API
🗣️ Whisper STT — Local speech-to-text with OpenAI Whisper
🔊 Edge-TTS — High-quality text-to-speech with Microsoft Edge voices
🔇 Silence Detection — Automatically detects when you stop talking

Quick Start / 快速开始

Prerequisites

Python 3.10+
FFmpeg installed
A separate Discord Bot Token (cannot share with other bots)

Setup

# Install dependencies
pip install -r requirements.txt

# Configure
cp .env.example .env
# Edit .env with your bot token and API keys

# Run
python bot.py

Discord Bot Setup

Go to Discord Developer Portal
Create New Application
Go to Bot → Copy Token, enable Message Content Intent
OAuth2 → URL Generator: Scopes: bot, Permissions: Connect, Speak, Use Voice Activity, Send Messages
Invite bot to your server with the generated URL

⚠️ Important: You need a dedicated bot token. Discord only allows one Gateway connection per token.

Tech Stack / 技术栈

Language: Python
STT: OpenAI Whisper (local)
TTS: edge-tts (Microsoft Edge voices)
LLM: Any OpenAI-compatible API
Audio: FFmpeg, discord.py voice

Configuration / 配置

Variable	Description
`DISCORD_TOKEN`	Bot token (dedicated, not shared)
`GUILD_ID`	Discord server ID
`AI_BASE_URL`	OpenAI-compatible API endpoint
`AI_API_KEY`	API key for LLM
`AI_MODEL`	Model name
`WHISPER_MODEL`	Whisper model size (tiny/base/small/medium/large)
`TTS_VOICE`	Edge-TTS voice name
`SYSTEM_PROMPT`	AI personality prompt

Contributing / 贡献

PRs welcome! Please test with a real Discord voice channel before submitting.

License / 许可证

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bot.py		bot.py
llm.py		llm.py
requirements.txt		requirements.txt
start.sh		start.sh
stt.py		stt.py
tts.py		tts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Discord Voice Bot

How It Works / 工作原理

Features / 功能特性

Quick Start / 快速开始

Prerequisites

Setup

Discord Bot Setup

Tech Stack / 技术栈

Configuration / 配置

Contributing / 贡献

License / 许可证

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ Discord Voice Bot

How It Works / 工作原理

Features / 功能特性

Quick Start / 快速开始

Prerequisites

Setup

Discord Bot Setup

Tech Stack / 技术栈

Configuration / 配置

Contributing / 贡献

License / 许可证

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages