Transcription Modes - OpenWhispr

OpenWhispr offers three transcription modes to balance privacy, speed, cost, and offline capabilities. Choose the mode that best fits your workflow.

Overview of Modes

OpenWhispr Cloud

Instant transcription with no API keys needed. Free and Pro plans available.

BYOK (Bring Your Own Key)

Use your own API keys from OpenAI, Groq, or Mistral for full control.

Local Processing

Complete privacy — your voice never leaves your device. Works offline.

Mode Comparison

Feature	OpenWhispr Cloud	BYOK	Local Processing
Privacy	⚠️ Audio sent to cloud	⚠️ Audio sent to provider	✅ Audio stays on device
Speed	⚡ Very fast	⚡ Fast	🐢 Slower (depends on hardware)
Cost	Free tier or $9/mo Pro	Pay per use	Free (after model download)
Offline	❌ Requires internet	❌ Requires internet	✅ Works offline
Setup	Email/Google sign-in	API key required	Download models (75MB - 3GB)
Accuracy	⭐⭐⭐⭐⭐ Excellent	⭐⭐⭐⭐⭐ Excellent	⭐⭐⭐⭐ Very good
Model Choice	Pre-selected	Multiple providers	6 Whisper models + Parakeet

OpenWhispr Cloud

The easiest way to get started — no API keys, no configuration, just sign in and start dictating.

Features

Free Tier: 2,000 words per week
Pro Tier: Unlimited transcriptions for $9/month
7-Day Free Trial: Try Pro features risk-free
Sign-In Options: Google OAuth or email/password
Managed Infrastructure: No API key management needed

When to Use

Getting Started

Perfect for trying OpenWhispr without any setup or configuration.

Casual Users

Ideal if you dictate occasionally and want a simple experience.

High Volume

Pro plan offers unlimited transcriptions for heavy users.

Team Usage

Share referral links to earn free Pro months for your team.

How to Set Up

Open Settings

Go to Settings → Transcription

Select OpenWhispr Cloud

Choose the OpenWhispr Cloud option (should be selected by default if signed in)

If not already signed in, click Sign In and choose Google or email/password

Start Dictating

Your transcriptions are automatically counted toward your weekly quota or Pro plan

Check your usage at any time in Settings → Account to see how many words you’ve used this week.

Referral Program

Earn free Pro months by inviting friends and colleagues to OpenWhispr Cloud. How it works:

Go to Settings → Referrals to get your unique referral link
Share your link via email or copy it to share anywhere
When someone signs up using your link and uses 2,000 words, you both earn rewards
Track your referrals and rewards in the Referrals dashboard

Rewards:

You earn: 1 free Pro month for each completed referral
They earn: Extended free trial or bonus credits
No limit on referrals — invite as many people as you want!

A referral is “completed” when the person you invited transcribes at least 2,000 words using OpenWhispr Cloud.

BYOK (Bring Your Own Key)

Bring Your Own Key mode lets you use your own API keys from supported providers, giving you full control over billing and model selection.

Supported Providers

OpenAI
Groq
Mistral
Custom Endpoint

Models Available:

gpt-4o-transcribe — Most accurate
gpt-4o-mini-transcribe — Fast and accurate
whisper-1 — Original Whisper model

Cost: ~$0.006 per minuteGet API Key: OpenAI Platform

Models Available:

whisper-large-v3-turbo — 216x real-time speed
whisper-large-v3 — High accuracy

Cost: Very low (see Groq pricing)Get API Key: Groq Console

Groq offers extremely fast inference — often transcribing in under 1 second.

Models Available:

voxtral-mini-latest — Fast multilingual transcription

Cost: See Mistral pricingGet API Key: Mistral AI Platform

When to Use

High Volume

If you transcribe heavily and want to manage your own costs.

Enterprise

Organizations with existing API contracts or custom endpoints.

Speed Priority

Groq’s ultra-fast inference is perfect for real-time workflows.

Custom Infrastructure

You’re running your own Whisper API or using alternative providers.

How to Set Up

Get API Key

Open Settings

Navigate to Settings → Transcription

Select BYOK Mode

Choose Bring Your Own Key (BYOK) from the mode selector

Choose Provider

Select your provider (OpenAI, Groq, Mistral, or Custom)

Enter API Key

Paste your API key into the appropriate field

Select Model

Choose your preferred model from the dropdown

Save

Your settings are saved automatically

Keep your API keys secure! They are stored locally in your system keychain and never sent to OpenWhispr servers.

Local Processing

Local processing keeps your voice data completely private by running speech recognition directly on your device using Whisper or NVIDIA Parakeet models.

Available Engines

Whisper (whisper.cpp)
NVIDIA Parakeet (sherpa-onnx)

Overview: OpenAI’s Whisper model via high-performance C++ implementationModels:

Tiny (75MB) — Fastest, lower quality
Base (142MB) — ⭐ Recommended balance
Small (466MB) — Better quality, slower
Medium (1.5GB) — High quality
Large (3GB) — Best quality, slowest
Turbo (1.6GB) — Fast with good quality

Languages: 58 languages supportedHardware: Runs on CPU (no GPU required)

When to Use

Privacy-First

Your audio never leaves your device — perfect for sensitive content.

Offline Work

Continue dictating without internet connection (flights, remote areas).

Cost Savings

After downloading models, transcription is completely free.

Custom Control

Choose exactly which model and parameters to use.

How to Set Up

Open Settings

Navigate to Settings → Transcription

Select Local Processing

Toggle Use Local Processing to ON

Choose Engine

Select Whisper or NVIDIA Parakeet as your transcription engine

Download Model

For Whisper: Select model size (Base recommended for most users)
For Parakeet: Select Parakeet TDT 0.6B
Click Download Model and wait for completion

Start Dictating

Your audio is now processed locally on your device

Model Storage

macOS/Linux: ~/.cache/openwhispr/whisper-models/ or ~/.cache/openwhispr/parakeet-models/
Windows: %USERPROFILE%\.cache\openwhispr\whisper-models\ or parakeet-models\

You can remove downloaded models in Settings → Transcription → Local Model Storage → Remove Downloaded Models.

Start with the Base Whisper model (142MB) — it offers the best balance of speed and accuracy for most users.

Performance Tips

Choose the Right Model

Quick dictation: Use Tiny or Base
General use: Use Base or Small (recommended)
High accuracy: Use Medium or Large
Fast multilingual: Use Parakeet TDT 0.6B

Hardware Considerations

Larger models require more RAM (Large needs ~4GB free)
Modern CPUs (2018+) handle Base/Small models well
Older hardware should stick to Tiny or Base
Parakeet is optimized for CPU inference

Processing Time

Tiny: ~0.1x real-time (10-second audio = 1 second)
Base: ~0.3x real-time (10-second audio = 3 seconds)
Small: ~0.8x real-time (10-second audio = 8 seconds)
Medium/Large: 1-3x real-time (can be slower than audio length)
Parakeet: ~0.2x real-time (very fast)

Switching Between Modes

You can switch modes at any time:

Open Settings → Transcription
Select your desired mode
Configure any required API keys or download models
Start dictating — the new mode takes effect immediately

Your mode preference is saved automatically and persists across app restarts.

Which Mode Should You Choose?

Start with OpenWhispr Cloud

If you’re new to OpenWhispr, begin with the cloud mode — it’s the easiest way to get started and includes a generous free tier.

Try Local for Privacy

If you handle sensitive information (medical, legal, confidential), switch to local processing for complete privacy.

Use BYOK for Control

If you need specific models, custom endpoints, or want to manage your own costs, use BYOK mode.

Next Steps

AI Processing

Enhance your transcriptions with AI formatting and cleanup

Custom Dictionary

Add technical terms and names for better accuracy

Notes System

Create organized voice notes with transcription

Basic Dictation

Learn the fundamentals of using OpenWhispr

Get Started

Core Features

Configuration

Platform Guides

Advanced

​Overview of Modes

OpenWhispr Cloud

BYOK (Bring Your Own Key)

Local Processing

​Mode Comparison

​OpenWhispr Cloud

​Features

​When to Use

Getting Started

Casual Users

High Volume

Team Usage

​How to Set Up

​Referral Program

​BYOK (Bring Your Own Key)

​Supported Providers

​When to Use

High Volume

Enterprise

Speed Priority

Custom Infrastructure

​How to Set Up

​Local Processing

​Available Engines

​When to Use

Privacy-First

Offline Work

Cost Savings

Custom Control

​How to Set Up

​Model Storage

​Performance Tips

​Switching Between Modes

​Which Mode Should You Choose?

​Next Steps

AI Processing

Custom Dictionary

Notes System

Basic Dictation

Build docs developers (and LLMs) love

Overview of Modes

Mode Comparison

OpenWhispr Cloud

Features

When to Use

How to Set Up

Referral Program

BYOK (Bring Your Own Key)

Supported Providers

When to Use

How to Set Up

Local Processing

Available Engines

When to Use

How to Set Up

Model Storage

Performance Tips

Switching Between Modes

Which Mode Should You Choose?

Next Steps