🗣️ Yapper — Voice to Clipboard

Self-contained speech-to-text tool. Press a hotkey to record, press again to stop. Audio is transcribed locally using Whisper and copied to your clipboard. No external server, no cloud, one process. 🔒

🚀 Two Ways to Run

Yapper has two entry points — same engine, different interfaces:

yapper — Raw/headless mode. Minimal terminal output, no dependencies beyond the core. Good for scripting, running in the background, or if you just want it to work without any fuss.
yapper-tui — Rich terminal dashboard. Live status panel, real-time waveform visualization, activity log, transcription display, and audio feedback sounds (beeps in your headphones on start/stop). Good for when you want to see what the app is doing. 🎧

Both accept the same CLI options. The TUI is a separate layer that wraps the core — it never touches the engine logic.

⚡ How It Works

On launch, loads a Whisper model onto GPU (or CPU) — stays in memory
Press Shift+Space to start recording from your microphone 🎤
Press Shift+Space again to stop
Audio is transcribed locally with VAD filtering (strips silence)
Result is copied to clipboard and a notification fires 📋
Press Ctrl+Shift+; to toggle language (EN ↔ HR)
Ready for the next recording ♻️

📦 Installation

From GitHub Releases (recommended)

Download the latest release from the Releases page.

From source

🐧 Linux (Ubuntu/Debian)

# System dependencies
sudo apt install libportaudio2 xclip

# (Optional) For Wayland clipboard support instead of xclip:
# sudo apt install wl-clipboard

# Clone and install
git clone https://github.com/dlozina/yapper.git
cd yapper
uv venv
source .venv/bin/activate

# Install PyTorch with CUDA (for GPU acceleration)
uv pip install torch --index-url https://download.pytorch.org/whl/cu124

# Install yapper
uv pip install -e .

🍎 macOS

brew install portaudio

git clone https://github.com/dlozina/yapper.git
cd yapper
uv venv
source .venv/bin/activate
uv pip install -e .

Notes:

No CUDA on macOS — uses CPU with int8 quantization. Apple Silicon (M-series) runs the base model in under a second for short clips.
macOS will prompt for microphone access and Accessibility permissions (for pynput) on first run.

🪟 Windows

git clone https://github.com/dlozina/yapper.git
cd yapper
uv venv
.venv\Scripts\activate

# Install PyTorch with CUDA
uv pip install torch --index-url https://download.pytorch.org/whl/cu124

# Install yapper
uv pip install -e .

Note: Install Visual C++ Redistributable if not already present.

🎯 Usage

TUI mode (recommended for interactive use)

# Terminal dashboard with waveform, status, and sound feedback
yapper-tui

# With options
yapper-tui --model large-v3-turbo --language en
yapper-tui --compute cpu

Raw mode (headless / minimal)

# Plain terminal output, no rich UI
yapper

# Same options
yapper --model base --compute cpu

Common options

# Set language explicitly (skips detection, faster)
yapper-tui --language en

# Croatian
yapper-tui --language hr

# Custom hotkey
yapper-tui --hotkey "<ctrl>+<space>"

# Save recordings to ~/Recordings/yapper/
yapper-tui --save

# List available microphones
yapper-tui --list-devices

# Pick a specific microphone by index
yapper-tui --device 3

# Force CPU even if CUDA is available
yapper-tui --compute cpu

⚙️ CLI Options

Option	Default	Description
`--model`	`base`	Whisper model: tiny, base, small, medium, large-v3, large-v3-turbo
`--language`	`en`	Language code (en, hr, etc.)
`--hotkey`	`<shift>+<space>`	Global hotkey in pynput format
`--lang-hotkey`	`<ctrl>+<shift>+;`	Hotkey to toggle language
`--device`	`auto`	Audio input device index or auto
`--compute`	`auto`	Force cuda or cpu (auto detects CUDA)
`--save`	off	Save audio + transcript to ~/Recordings/yapper/
`--list-devices`	—	List audio input devices and exit

🧠 Models

Model	Size	Speed (GPU)	Speed (CPU)	Quality
tiny	~75MB	instant	fast	basic
base	~150MB	instant	fast	good ✅
small	~500MB	instant	moderate	better
medium	~1.5GB	fast	slow	great
large-v3-turbo	~1.5GB	fast	slow	best 🏆

Models download automatically on first run and are cached in ~/.cache/huggingface/.

🏗️ Architecture

yapper/                  Core package
  cli.py                 CLI entry point (parse_args, main)
  core.py                Engine — recording, transcription, clipboard
  devices.py             Device detection, mic check, paired output
  clipboard.py           Cross-platform clipboard with fallback chain
  compute.py             GPU/CPU detection
  notifications.py       Desktop notifications
  constants.py           Shared constants

yapper_tui.py            TUI entry point — wires core + display + sounds
tui/
  display.py             Rich Live dashboard (status, waveform, log)
  sounds.py              Audio feedback (beeps via dedicated OutputStream)

The core never imports rich or anything from tui/. The TUI subscribes to events via callbacks and adds its own rendering and sound layer. This separation means:

🧪 The core can be tested and used without any UI
🔌 Alternative frontends (GUI, web, system tray) can wrap the same engine
🎨 The TUI can evolve independently without risking the transcription logic

🖥️ TUI Features

Status panel — shows idle/recording/transcribing with color-coded borders
Live waveform — real-time audio level visualization using Unicode block characters
Recording timer — elapsed time shown during recording
Activity log — timestamped log of all events
Last transcription — always visible for reference
Sound feedback — ascending beep on record start, descending on stop, chirp when clipboard is ready 🔔
Smart audio routing — sounds play through headphones when using a headset mic (e.g. PlayStation Link, Elgato Wave) 🎧

📝 Platform Notes

🐧 Linux — Wayland

pynput requires X11 or XWayland for global hotkey capture. On pure Wayland, options:

Run under XWayland (most desktop environments still support this)
Switch to X11 session
Future: evdev-based input backend (planned)

🐧 Linux — PipeWire/PulseAudio

sounddevice works with both PipeWire and PulseAudio. Use --list-devices to see what's available.

🍎 macOS — Permissions

First run will trigger two permission prompts:

Microphone access — required for recording
Accessibility — required for pynput global hotkey

Both are one-time grants in System Settings.

🪟 Windows — Notes

CUDA works if you have an NVIDIA GPU and install PyTorch with CUDA support
CPU mode uses int8 quantization — works fine for tiny and base models
Clipboard uses pyperclip (which uses win32clipboard under the hood)
Notifications use win10toast — install with uv pip install yapper[windows]

📄 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
tests		tests
tui		tui
yapper		yapper
.gitignore		.gitignore
IMPLEMENTATION.md		IMPLEMENTATION.md
LICENSE		LICENSE
README.md		README.md
RELEASING.md		RELEASING.md
justfile		justfile
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
yapper_tui.py		yapper_tui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🗣️ Yapper — Voice to Clipboard

🚀 Two Ways to Run

⚡ How It Works

📦 Installation

From GitHub Releases (recommended)

From source

🐧 Linux (Ubuntu/Debian)

🍎 macOS

🪟 Windows

🎯 Usage

TUI mode (recommended for interactive use)

Raw mode (headless / minimal)

Common options

⚙️ CLI Options

🧠 Models

🏗️ Architecture

🖥️ TUI Features

📝 Platform Notes

🐧 Linux — Wayland

🐧 Linux — PipeWire/PulseAudio

🍎 macOS — Permissions

🪟 Windows — Notes

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🗣️ Yapper — Voice to Clipboard

🚀 Two Ways to Run

⚡ How It Works

📦 Installation

From GitHub Releases (recommended)

From source

🐧 Linux (Ubuntu/Debian)

🍎 macOS

🪟 Windows

🎯 Usage

TUI mode (recommended for interactive use)

Raw mode (headless / minimal)

Common options

⚙️ CLI Options

🧠 Models

🏗️ Architecture

🖥️ TUI Features

📝 Platform Notes

🐧 Linux — Wayland

🐧 Linux — PipeWire/PulseAudio

🍎 macOS — Permissions

🪟 Windows — Notes

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages