Quick Start Guide

Get OpenWhispr up and running in under 5 minutes. This guide will walk you through installation, setup, and your first successful transcription.

Already installed? Skip to First Transcription.

Installation

Download OpenWhispr

Visit the GitHub Releases page and download the appropriate installer for your platform:

macOS: OpenWhispr-{version}-mac-{arch}.dmg (Apple Silicon or Intel)
Windows: OpenWhispr-{version}-win-x64.exe (NSIS installer)
Linux: Choose from .AppImage, .deb, .rpm, or .tar.gz

Install the application

macOS
Windows
Linux

Open the .dmg file
Drag OpenWhispr.app to your Applications folder
Right-click the app and select Open (first time only)
Click Open in the security dialog

macOS may show a security warning for unsigned apps. Right-click → Open bypasses this.

Run the OpenWhispr-Setup-{version}.exe installer
Follow the installation wizard
Launch from Start Menu or Desktop shortcut

Windows may show SmartScreen warning. Click “More info” → “Run anyway” if needed.

# Make executable
chmod +x OpenWhispr-*.AppImage

# Run directly
./OpenWhispr-*.AppImage

Grant permissions

When you first launch OpenWhispr, your OS will request necessary permissions:Microphone Access (Required)

Needed to record your voice
You’ll see a system prompt on first use

Accessibility Permissions (macOS only, recommended)

Enables automatic text pasting at cursor
Go to System Settings → Privacy & Security → Accessibility
Enable checkbox for OpenWhispr
Without this, text copies to clipboard (manual paste with Cmd+V)

The Control Panel has a “Fix Permission Issues” button if you need to adjust settings later.

First-Time Setup

OpenWhispr guides you through setup with an interactive onboarding flow:

Welcome Screen

Choose your setup path:

Sign In: Use OpenWhispr Cloud (recommended for beginners)
Skip for Now: Set up local or BYOK processing

Choose Processing Method

OpenWhispr Cloud (Easiest)
Bring Your Own Key
Local Processing

Recommended for first-time users

Click “Sign in with Google” or “Sign in with Email”
Complete authentication flow
Verify email if using email sign-in
Start with 2,000 free words/week (7-day Pro trial included)

No API keys, no downloads. Start transcribing immediately after sign-in.

For users with existing API accounts

Select “Cloud (API Key)” as processing method
Choose provider: OpenAI, Groq, Mistral, or AssemblyAI
Enter your API key
Select transcription model

OPENAI_API_KEY=sk-proj-...

API keys are stored securely in your system keychain and never transmitted except to the chosen provider.

For maximum privacy

Select “Local” as processing method
Choose engine: Whisper (quality) or Parakeet (speed)
Select model size
Download model (automatic on first use)

Whisper Models:

base (142MB) - Recommended for most users
small (466MB) - Better quality
tiny (75MB) - Fastest, lower accuracy

Parakeet Models:

parakeet-tdt-0.6b-v3 (680MB) - Fast, 25 languages

Models download to ~/.cache/openwhispr/ and are reused across sessions.

Configure Hotkey

Set your global dictation hotkey:

Default: Backtick (`) on most platforms
macOS: Option to use Globe/Fn key
Custom: Click the input field and press your preferred key combination

Recommended hotkeys:

Simple: `, F8, F9
Compound: Cmd+Shift+D, Ctrl+Alt+V

Choose an unused key that’s easy to reach. You can change this later in Settings.

Select Activation Mode

Choose how the hotkey triggers recording:

Tap to Toggle (Default): Press once to start, press again to stop
Push to Talk (Windows): Hold key to record, release to transcribe

Push-to-talk requires native keyboard hooks and is currently only available on Windows.

Complete Setup

Click “Finish Setup” to save your configuration. OpenWhispr is now ready to use!

Your First Transcription

Now let’s create your first transcription:

Open any text field

Click into any application where you want text to appear:

Email composer
Note-taking app
Text editor
Chat application
Anywhere with a text input!

Press your hotkey

Press the hotkey you configured (default: backtick `).You’ll see:

A small draggable panel appears
Panel shows recording animation (pulsing waves)
Your microphone starts capturing audio

The panel can be dragged anywhere on screen. Its position is saved for next time.

Speak clearly

Say your message naturally. Examples:

“Hello, this is a test of OpenWhispr dictation.”
“Meeting notes: Discussed Q1 roadmap with Sarah and John.”
“Hey OpenWhispr, write a professional email thanking the client for their feedback.”

Speak naturally at a normal pace. Pause briefly between sentences for better punctuation.

Stop recording

Press your hotkey again (or release if using push-to-talk).What happens:

Recording stops immediately
Panel shows processing animation (loading dots)
Audio is transcribed (local or cloud, depending on your setup)
Processing time: 1-10 seconds depending on method and length

See your transcription

Transcribed text appears automatically:

With accessibility permissions: Text pastes at cursor location
Without permissions: Text copies to clipboard (paste with Cmd+V or Ctrl+V)
AI processing: If you addressed your agent (“Hey OpenWhispr…”), the AI-processed result appears

All transcriptions are saved to local history. Access them via the Control Panel.

Testing Different Features

Basic Dictation

Simple transcription without AI processing:

Press hotkey → "This is a simple test message" → Press hotkey
Result: This is a simple test message

AI-Powered Processing

Address your agent for intelligent text transformation:

Press hotkey → "Hey OpenWhispr, make this more professional: thanks for the help" → Press hotkey
Result: Thank you for your assistance. I appreciate your help with this matter.

AI processing requires an AI provider to be configured (OpenWhispr Cloud, OpenAI, Anthropic, Gemini, Groq, or local LLM).

Custom Dictionary

Improve accuracy for technical terms:

Open Control Panel (right-click tray icon)
Go to Settings → Custom Dictionary
Add terms: “Kubernetes”, “OAuth”, “PostgreSQL”
Dictate: “We deployed the app to Kubernetes using OAuth for authentication”
Results are now more accurate!

Accessing the Control Panel

The Control Panel is your hub for settings, history, and model management: How to open:

macOS: Right-click the menu bar icon → “Show Control Panel”
Windows: Right-click the system tray icon → “Show Control Panel”
Linux: Right-click the system tray icon → “Show Control Panel”

What you can do:

View and search transcription history
Switch between local and cloud processing
Download and manage Whisper/Parakeet models
Configure API keys for different providers
Customize hotkey and activation mode
Add words to custom dictionary
Check system permissions
View usage statistics (for cloud users)

Common First-Time Issues

No audio detected

Solutions:

Check microphone permissions in system settings
Select correct input device in Control Panel → Settings
Test microphone with another app to verify hardware
Restart OpenWhispr after granting permissions

Open sound settings:

macOS: Control Panel → “Open Sound Settings” button
Windows: Control Panel → “Open Sound Settings” button
Linux: Use pavucontrol or system sound settings

Hotkey doesn't work

Solutions:

Check if another app is using the same hotkey
Try a different key (F8, F9, or compound hotkey)
On GNOME Wayland: Check Settings → Keyboard → Shortcuts for conflicts
Restart the app after changing hotkeys

OpenWhispr will automatically fall back to F8 if your chosen hotkey is unavailable.

Text doesn't paste automatically

macOS:

Grant accessibility permissions in System Settings
Go to Privacy & Security → Accessibility
Enable OpenWhispr checkbox
Restart app if needed

Windows:

No special permissions needed
Text should paste automatically via SendInput API
If failing, text copies to clipboard (Ctrl+V to paste)

Linux:

Install paste tool: sudo apt install xdotool (X11)
For Wayland: sudo apt install wtype or ydotool
Text always copies to clipboard as fallback

Model download fails (local processing)

Solutions:

Check internet connection
Ensure sufficient disk space (75MB to 3GB depending on model)
Try downloading manually from Control Panel → Settings → Models
Check ~/.cache/openwhispr/ permissions
Try a smaller model first (tiny or base)

Models download from HuggingFace and may take several minutes on slow connections.

Transcription is inaccurate

Improve accuracy:

Use a better quality model (small, medium, or large)
Add specialized terms to Custom Dictionary
Speak clearly and at a moderate pace
Reduce background noise
Use a better quality microphone
Set your language explicitly (Settings → Language) instead of auto-detect

The base model offers the best balance for most users. Upgrade to small or medium for technical content.

Next Steps

Now that you have OpenWhispr working:

Explore Features

Learn about AI processing, custom dictionary, and advanced options

Configuration Guide

Customize settings, hotkeys, and processing preferences

Platform Guides

Platform-specific features and setup

Troubleshooting

Get help with common issues and error messages

Documentation

Return to main documentation

Configuration Guide

Customize settings, hotkeys, and processing preferences

Keyboard Shortcuts

Master all keyboard shortcuts and productivity tips

Troubleshooting

Solutions for common issues and platform-specific problems

Getting Help

GitHub Issues

Report bugs or request features

Documentation

Browse complete documentation

Community

Ask questions and share tips

Get Started

Core Features

Configuration

Platform Guides

Advanced

Quick Start Guide

Quick Start Guide

Installation

First-Time Setup

Your First Transcription

Testing Different Features

Basic Dictation

AI-Powered Processing

Custom Dictionary

Accessing the Control Panel

Common First-Time Issues

Next Steps

Explore Features

Configuration Guide

Platform Guides

Troubleshooting

Documentation

Configuration Guide

Keyboard Shortcuts

Troubleshooting

Getting Help

GitHub Issues

Documentation

Community

Build docs developers (and LLMs) love

Get Started

Core Features

Configuration

Platform Guides

Advanced

​Quick Start Guide

​Installation

​First-Time Setup

​Your First Transcription

​Testing Different Features

​Basic Dictation

​AI-Powered Processing

​Custom Dictionary

​Accessing the Control Panel

​Common First-Time Issues

​Next Steps

Explore Features

Configuration Guide

Platform Guides

Troubleshooting

Documentation

Configuration Guide

Keyboard Shortcuts

Troubleshooting

​Getting Help

GitHub Issues

Documentation

Community

Build docs developers (and LLMs) love

Quick Start Guide

Installation

First-Time Setup

Your First Transcription

Testing Different Features

Basic Dictation

AI-Powered Processing

Custom Dictionary

Accessing the Control Panel

Common First-Time Issues

Next Steps

Getting Help