Skip to content

Speech to text,
your way

Press a shortcut, speak naturally, get text in any app. Go fully local for privacy or bring your own API keys (BYOK) for cloud accuracy.

View on GitHub

Free & open source · macOS 12+

See it in action

Watch the app in action. Try the Fn key below to see real-time captioning.

General
Shortcuts
Models
Dictionary
Polish
History

Transcription Models

Select a transcription model or download additional models.

My ModelsLibrary

OpenAI

OpenAI’s cloud speech-to-text API. Fast and accurate with support for 57+ languages.

Verify

Uses a WebSocket connection instead of file upload for lower latency

Language
Auto detect
Leave empty to auto detect
Glossary:
Temperature

Higher values produce more random results (0-1). Only supported by whisper-1.

0
Multi-languageTranslate

Deepgram

Deepgram’s Nova speech-to-text API. Fast and accurate with 50+ language support.

Multi-language

Parakeet V2

Active

English only. The best model for English speakers.

English Only

API Configuration

Anthropic · claude-sonnet-4-20250514

Prompts

Mild - Correct Transcript
Built-in
Medium - Improve Fluency
Built-in
Aggressive - Restructure & Format
Built-in
Parakeet V2

Press & Speak

Hold the Fn key, speak naturally, get text instantly

How it works

Pick your engine, choose a post-processing model, set your polish level — done.

Your voice

STT Providers

LLM Providers

Polish Prompt

Polished text

Everything you need to dictate

Local, cloud, or both. No subscriptions.

Local or Cloud — You Choose

Run transcription fully on-device for privacy, or bring your own API keys for cloud-powered accuracy. Flexible by design.

Press & Speak

One keyboard shortcut to start. Hold, toggle, or always-on — speak naturally, get text instantly in any app.

LLM Polish

Clean up transcriptions with OpenAI, Anthropic, Gemini, Groq, OpenRouter, Cerebras, Z.AI, Apple Intelligence, or your own endpoint. Mild, medium, or aggressive — your call.

Dictionary & History

Add custom terms so your jargon lands right. Full transcription history with daily stats and words-per-minute tracking.

Native & Fast

Built with Tauri and Rust. Lightweight, instant startup, native macOS feel. Voice activity detection stops recording when you stop speaking.

Common questions

Ready to go hands-free?

Free, open source, and built for privacy.

View on GitHub