d-scribe

Record Discord voice channel audio and transcribe it with speaker attribution using Whisper.

Features

Capture audio from Discord (loopback + microphone) via WASAPI on Windows
Track speakers using Discord RPC speaking events
Transcribe segments with whisper.cpp
Export to SRT or VTT
Auto-save sessions to a recent folder (configurable retention, default 10 days)
Playback with remote/local/both modes, auto-scroll transcript during playback
Project list from default location; click to open, delete with optional audio cleanup
Discord auth persisted via refresh token; auto-reconnect on startup

Prerequisites

Node.js & npm
Rust (for Tauri)
Discord running (for RPC connection)
Whisper binary – required for transcription (see below)

Quick Start

1. Install dependencies

npm install

2. Set up Whisper (required for transcription)

Windows (recommended):

.\src-tauri\binaries\download-whisper.ps1

Or see src-tauri/binaries/README.md for manual setup.

3. Download a Whisper model

Run the app, open Settings, and download a model (e.g. ggml-base.en.bin). Models are stored in %APPDATA%/d-scribe/models/.

4. Run the app

npm run tauri dev

5. Connect & record

Open Settings → connect to Discord (Client ID, Client Secret, RPC Origin). Auth is saved; you typically only need to authorize once.
Join a voice channel
Click Start Recording
When done, click Stop Recording – the session is auto-saved to recent
Use Play to listen; transcript auto-scrolls with playback
Click Transcribe to run Whisper on each segment
Save Project to move to permanent storage, or Export to SRT/VTT

Project Structure

d-scribe/
├── src/                 # React frontend (Vite + TypeScript)
├── src-tauri/
│   ├── src/             # Rust backend
│   │   ├── lib.rs       # Tauri commands, transcription orchestration
│   │   ├── audio/       # WASAPI capture
│   │   ├── discord_rpc/ # Discord RPC, OAuth, token persistence
│   │   ├── session/     # Recording, segments, merge buffer
│   │   ├── project.rs   # Save/load, auto-save, purge, delete
│   │   └── transcription/ # WAV extraction, Whisper CLI
│   └── binaries/        # whisper-cli.exe + DLLs (run download script)
└── docs/

Data Locations

Projects: %APPDATA%/d-scribe/projects/ (permanent saves)
Recent sessions: %APPDATA%/d-scribe/projects/recent/ (auto-saved; purged by retention)
Models: %APPDATA%/d-scribe/models/
Transcription temp: %APPDATA%/d-scribe/transcribe_temp/

Settings

Project name template: Placeholders {guild}, {channel}, {timestamp}, {date}, {time} for session IDs and filenames
Recent sessions retention (days): How long auto-saved sessions are kept (default 10)
Segment merge buffer (ms): Min silence before splitting segments
Playback mode: Remote, Local, or Both (default Both; persisted)

Build

npm run tauri build

Troubleshooting

Enable debug logging (to diagnose Discord RPC, transcription, etc.):

PowerShell: $env:RUST_LOG="d_scribe=debug,wasapi=warn"; npm run tauri dev
Cmd: set RUST_LOG=d_scribe=debug,wasapi=warn && npm run tauri dev

(Plain RUST_LOG=debug floods the terminal with WASAPI trace logs.)

Zero segments after recording: Segmentation comes from Discord RPC speaking events. Ensure you're connected in Settings and in the voice channel before recording. If you get 0 segments, the RPC subscription or connection may need debugging.

Planned Features

AI summary – Generate summaries of transcripts using AI/LLM
Participant stats – Word counts and speaking time per participant
Meeting notes workflow – Supported flow for creating structured meeting notes from transcripts
Debate/discussion analysis – Workflow for analyzing debates and discussions (arguments, positions, rebuttals)
Custom workflows – User-defined workflows with configurable LLM instructions (e.g. custom prompts, output formats)
Flexible AI/LLM support – Use both local models (e.g. Ollama, llama.cpp) and remote APIs (OpenAI, Anthropic, etc.) for summarization and other tasks
Privacy: mute-aware recording – Listen for Discord mute events; when the current user or others mute, disable the corresponding local audio stream so muted participants are not recorded

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs/third-party		docs/third-party
public		public
src-tauri		src-tauri
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

d-scribe

Features

Prerequisites

Quick Start

1. Install dependencies

2. Set up Whisper (required for transcription)

3. Download a Whisper model

4. Run the app

5. Connect & record

Project Structure

Data Locations

Settings

Build

Troubleshooting

Planned Features

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

d-scribe

Features

Prerequisites

Quick Start

1. Install dependencies

2. Set up Whisper (required for transcription)

3. Download a Whisper model

4. Run the app

5. Connect & record

Project Structure

Data Locations

Settings

Build

Troubleshooting

Planned Features

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages