Speech

A macOS menu bar app for speech-to-text using local Whisper models. All transcription happens on-device — no cloud services, no API keys, complete privacy.

Features

Hold-to-record: Hold the hotkey to record, release to transcribe and auto-paste
Local processing: Whisper models running entirely on your Mac
Privacy-first: No data leaves your device
Multiple engines: Whisper, Moonshine, Parakeet, SenseVoice
Model profiles: Save different model configurations and switch between them
Multi-language: 25+ languages with auto-detection
Customizable hotkey: Default is ⌥Space (Option + Space)
Auto-update: Checks for updates from GitHub releases

Tech Stack

Tauri 2 — native macOS app shell
Rust — audio recording, transcription, hotkeys, paste injection
Svelte 5 + Tailwind CSS — UI (menu bar panel, overlays, settings)
transcribe-rs — local Whisper inference

Installation

Requirements

macOS 14.0 or later

Download & Install

Download the latest release from Releases
Unzip and drag Speech.app to /Applications
Open Speech.app

macOS Gatekeeper: Since the app is not notarized, macOS may block it. Right-click → Open → Open again, or go to System Settings → Privacy & Security → Open Anyway.

Build from Source

git clone https://github.com/kennetkusk/speech.git
cd speech
bun install
./build_app.sh
cp -R .build/Speech.app /Applications/
open /Applications/Speech.app

Setup

On first launch, grant these permissions (check status in Settings → Permissions):

Microphone — required for recording
Accessibility — required for auto-paste and global hotkey
Input Monitoring — required for keyboard simulation

Then download a model in Settings → Models.

Usage

Hold ⌥Space to start recording
Speak clearly
Release to transcribe and auto-paste

Press Escape while recording to cancel.

Settings

Click the menu bar icon → Settings to configure:

General: Hotkey, language, auto-paste, launch at login
Models: Download/manage transcription models
Profiles: Save model configurations for quick switching
Permissions: Check macOS permission status

Privacy

All audio is processed locally. No data is sent to external servers.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github/workflows		.github/workflows
src-tauri		src-tauri
src		src
.gitignore		.gitignore
.prettierignore		.prettierignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
build_app.sh		build_app.sh
bun.lock		bun.lock
index.html		index.html
package.json		package.json
svelte.config.js		svelte.config.js
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech

Features

Tech Stack

Installation

Requirements

Download & Install

Build from Source

Setup

Usage

Settings

Privacy

License

About

Uh oh!

Releases 22

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speech

Features

Tech Stack

Installation

Requirements

Download & Install

Build from Source

Setup

Usage

Settings

Privacy

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 22

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages