Overview
OpenWhispr supports NVIDIA Parakeet models for local transcription via the sherpa-onnx runtime. Parakeet models offer:- Fast inference with INT8 quantization
- Multilingual support (25+ languages)
- Automatic language detection
- Privacy-first local processing
Model Technology
sherpa-onnx (ONNX Runtime for cross-platform inference)
INT8 (8-bit integer quantization for efficient CPU inference)
NVIDIA NeMo Transducer (streaming ASR architecture)
Storage Location
Models are cached locally at:encoder.int8.onnxdecoder.int8.onnxjoiner.int8.onnxtokens.txt
Available Models
Parakeet TDT 0.6B v3 (680MB) — Recommended
Parakeet TDT 0.6B
parakeet-tdt-0.6b-v3680MB
Multilingual (automatic detection)
25 languages including:
- Germanic: English, German, Dutch, Swedish, Danish, Norwegian
- Romance: French, Spanish, Portuguese, Italian, Romanian
- Slavic: Russian, Ukrainian, Polish, Czech, Slovak, Bulgarian, Croatian, Slovenian
- Baltic: Latvian, Lithuanian, Estonian
- Other: Greek, Hungarian, Maltese, Finnish
https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-nemo-parakeet-tdt-0.6b-v3-int8.tar.bz2sherpa-onnx-nemo-parakeet-tdt-0.6b-v3-int8trueSupported Languages (Full List)
View all 25 supported languages
View all 25 supported languages
| Code | Language | Code | Language |
|---|---|---|---|
| bg | Bulgarian | it | Italian |
| hr | Croatian | lv | Latvian |
| cs | Czech | lt | Lithuanian |
| da | Danish | mt | Maltese |
| nl | Dutch | pl | Polish |
| en | English | pt | Portuguese |
| et | Estonian | ro | Romanian |
| fi | Finnish | sk | Slovak |
| fr | French | sl | Slovenian |
| de | German | es | Spanish |
| el | Greek | sv | Swedish |
| hu | Hungarian | ru | Russian |
| — | — | uk | Ukrainian |
When to Use Parakeet vs Whisper
Parakeet vs Whisper Comparison
Parakeet vs Whisper Comparison
Use Parakeet When:
- You need fast multilingual transcription (25 European languages)
- You want automatic language detection
- You’re transcribing European languages (especially Slavic/Baltic)
- You prefer lighter resource usage
Use Whisper When:
- You need support for 58+ languages including Asian languages
- You require highest accuracy for English
- You’re working with technical/medical terminology
- You need fine-grained model size options (6 models vs 1)
Downloading Models
Via UI
Models can be downloaded from Settings → Models → Parakeet Models.Programmatically
Parakeet models are distributed as
.tar.bz2 archives and are automatically extracted after download. The extraction process requires the tar command to be available on your system.Checking Model Status
Listing Models
Deleting Models
Transcription
Server Management
Starting the Server
Checking Server Status
Stopping the Server
Server Pre-warming
Like Whisper, Parakeet supports server pre-warming on startup:When
LOCAL_TRANSCRIPTION_PROVIDER=nvidia is set in .env, the Parakeet server automatically starts on app launch, eliminating cold-start delays.Provider Configuration
Set Parakeet as your transcription provider:Binary Location
The sherpa-onnx binary is bundled with the app:- macOS: arm64, x64
- Linux: x64, arm64
- Windows: x64
Installation Check
Parakeet models require the
tar command for extraction. On Windows, ensure you have tar available (included in Windows 10+ by default).Diagnostics
Model Registry
All Parakeet models are defined insrc/models/modelRegistryData.json: