Quick Start Guide
Get OpenWhispr up and running in under 5 minutes. This guide will walk you through installation, setup, and your first successful transcription.Already installed? Skip to First Transcription.
Installation
Download OpenWhispr
Visit the GitHub Releases page and download the appropriate installer for your platform:
- macOS:
OpenWhispr-{version}-mac-{arch}.dmg(Apple Silicon or Intel) - Windows:
OpenWhispr-{version}-win-x64.exe(NSIS installer) - Linux: Choose from
.AppImage,.deb,.rpm, or.tar.gz
Install the application
- macOS
- Windows
- Linux
- Open the
.dmgfile - Drag OpenWhispr.app to your Applications folder
- Right-click the app and select Open (first time only)
- Click Open in the security dialog
macOS may show a security warning for unsigned apps. Right-click → Open bypasses this.
Grant permissions
When you first launch OpenWhispr, your OS will request necessary permissions:Microphone Access (Required)
- Needed to record your voice
- You’ll see a system prompt on first use
- Enables automatic text pasting at cursor
- Go to System Settings → Privacy & Security → Accessibility
- Enable checkbox for OpenWhispr
- Without this, text copies to clipboard (manual paste with Cmd+V)
First-Time Setup
OpenWhispr guides you through setup with an interactive onboarding flow:Welcome Screen
Choose your setup path:
- Sign In: Use OpenWhispr Cloud (recommended for beginners)
- Skip for Now: Set up local or BYOK processing
Choose Processing Method
- OpenWhispr Cloud (Easiest)
- Bring Your Own Key
- Local Processing
Recommended for first-time users
- Click “Sign in with Google” or “Sign in with Email”
- Complete authentication flow
- Verify email if using email sign-in
- Start with 2,000 free words/week (7-day Pro trial included)
No API keys, no downloads. Start transcribing immediately after sign-in.
Configure Hotkey
Set your global dictation hotkey:
- Default: Backtick (`) on most platforms
- macOS: Option to use Globe/Fn key
- Custom: Click the input field and press your preferred key combination
- Simple: `, F8, F9
- Compound:
Cmd+Shift+D,Ctrl+Alt+V
Select Activation Mode
Choose how the hotkey triggers recording:
- Tap to Toggle (Default): Press once to start, press again to stop
- Push to Talk (Windows): Hold key to record, release to transcribe
Push-to-talk requires native keyboard hooks and is currently only available on Windows.
Your First Transcription
Now let’s create your first transcription:Open any text field
Click into any application where you want text to appear:
- Email composer
- Note-taking app
- Text editor
- Chat application
- Anywhere with a text input!
Press your hotkey
Press the hotkey you configured (default: backtick `).You’ll see:
- A small draggable panel appears
- Panel shows recording animation (pulsing waves)
- Your microphone starts capturing audio
The panel can be dragged anywhere on screen. Its position is saved for next time.
Speak clearly
Say your message naturally. Examples:
- “Hello, this is a test of OpenWhispr dictation.”
- “Meeting notes: Discussed Q1 roadmap with Sarah and John.”
- “Hey OpenWhispr, write a professional email thanking the client for their feedback.”
Stop recording
Press your hotkey again (or release if using push-to-talk).What happens:
- Recording stops immediately
- Panel shows processing animation (loading dots)
- Audio is transcribed (local or cloud, depending on your setup)
- Processing time: 1-10 seconds depending on method and length
See your transcription
Transcribed text appears automatically:
- With accessibility permissions: Text pastes at cursor location
- Without permissions: Text copies to clipboard (paste with Cmd+V or Ctrl+V)
- AI processing: If you addressed your agent (“Hey OpenWhispr…”), the AI-processed result appears
All transcriptions are saved to local history. Access them via the Control Panel.
Testing Different Features
Basic Dictation
Simple transcription without AI processing:AI-Powered Processing
Address your agent for intelligent text transformation:AI processing requires an AI provider to be configured (OpenWhispr Cloud, OpenAI, Anthropic, Gemini, Groq, or local LLM).
Custom Dictionary
Improve accuracy for technical terms:- Open Control Panel (right-click tray icon)
- Go to Settings → Custom Dictionary
- Add terms: “Kubernetes”, “OAuth”, “PostgreSQL”
- Dictate: “We deployed the app to Kubernetes using OAuth for authentication”
- Results are now more accurate!
Accessing the Control Panel
The Control Panel is your hub for settings, history, and model management: How to open:- macOS: Right-click the menu bar icon → “Show Control Panel”
- Windows: Right-click the system tray icon → “Show Control Panel”
- Linux: Right-click the system tray icon → “Show Control Panel”
- View and search transcription history
- Switch between local and cloud processing
- Download and manage Whisper/Parakeet models
- Configure API keys for different providers
- Customize hotkey and activation mode
- Add words to custom dictionary
- Check system permissions
- View usage statistics (for cloud users)
Common First-Time Issues
No audio detected
No audio detected
Solutions:
- Check microphone permissions in system settings
- Select correct input device in Control Panel → Settings
- Test microphone with another app to verify hardware
- Restart OpenWhispr after granting permissions
- macOS: Control Panel → “Open Sound Settings” button
- Windows: Control Panel → “Open Sound Settings” button
- Linux: Use
pavucontrolor system sound settings
Hotkey doesn't work
Hotkey doesn't work
Solutions:
- Check if another app is using the same hotkey
- Try a different key (F8, F9, or compound hotkey)
- On GNOME Wayland: Check Settings → Keyboard → Shortcuts for conflicts
- Restart the app after changing hotkeys
OpenWhispr will automatically fall back to F8 if your chosen hotkey is unavailable.
Text doesn't paste automatically
Text doesn't paste automatically
macOS:
- Grant accessibility permissions in System Settings
- Go to Privacy & Security → Accessibility
- Enable OpenWhispr checkbox
- Restart app if needed
- No special permissions needed
- Text should paste automatically via SendInput API
- If failing, text copies to clipboard (Ctrl+V to paste)
- Install paste tool:
sudo apt install xdotool(X11) - For Wayland:
sudo apt install wtypeorydotool - Text always copies to clipboard as fallback
Model download fails (local processing)
Model download fails (local processing)
Solutions:
- Check internet connection
- Ensure sufficient disk space (75MB to 3GB depending on model)
- Try downloading manually from Control Panel → Settings → Models
- Check
~/.cache/openwhispr/permissions - Try a smaller model first (tiny or base)
Transcription is inaccurate
Transcription is inaccurate
Improve accuracy:
- Use a better quality model (small, medium, or large)
- Add specialized terms to Custom Dictionary
- Speak clearly and at a moderate pace
- Reduce background noise
- Use a better quality microphone
- Set your language explicitly (Settings → Language) instead of auto-detect
Next Steps
Now that you have OpenWhispr working:Explore Features
Learn about AI processing, custom dictionary, and advanced options
Configuration Guide
Customize settings, hotkeys, and processing preferences
Platform Guides
Platform-specific features and setup
Troubleshooting
Get help with common issues and error messages
Documentation
Return to main documentation
Configuration Guide
Customize settings, hotkeys, and processing preferences
Keyboard Shortcuts
Master all keyboard shortcuts and productivity tips
Troubleshooting
Solutions for common issues and platform-specific problems
Getting Help
GitHub Issues
Report bugs or request features
Documentation
Browse complete documentation
Community
Ask questions and share tips