Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/KoljaB/RealtimeSTT/llms.txt

Use this file to discover all available pages before exploring further.

RealtimeSTT itself is released under the MIT license. Optional transcription, VAD, and wake-word engines pull in their own packages, model weights, service terms, and model cards — and those upstream terms can change independently from this repository. This page is a practical summary, not legal advice. Before shipping a product, redistributing model files, bundling wheels, or offering hosted transcription, review the upstream license files and terms for the exact package, model, revision, API, and distribution path you use.

Important Limits

  • “Commercial use” in the tables below means the cited upstream license or model card does not obviously prohibit commercial use. It does not cover privacy, consent, biometrics, export controls, trademarks, data-processing agreements, or platform-specific terms.
  • Runtime code and model weights can have different licenses.
  • Hosted APIs and model hubs can add access terms even when the model weights are open-licensed.
  • If you redistribute binaries, wheels, model files, or Docker images, preserve upstream copyright notices and license texts.
  • For Hugging Face models, verify the license on the exact model repository and revision you download. Derived, converted, quantized, or ONNX/GGUF versions may carry the upstream model license plus additional notices.

ASR Engine Licenses

RealtimeSTT engineRuntime / package licenseDefault or typical model licenseCommercial-use note
faster_whisperfaster-whisper is MIT. Its inference runtime, CTranslate2, is MIT.OpenAI Whisper code and model assets are published in the openai/whisper repository under MIT.Generally permissive, subject to preserving MIT notices when redistributing.
whisper_cppwhisper.cpp is MIT. RealtimeSTT uses the pywhispercpp package; check that package metadata before redistributing wheels.Uses OpenAI Whisper-family model files; check whether you use original OpenAI files or third-party conversions.Generally permissive for original Whisper assets, but converted model files should be checked individually.
openai_whisperopenai-whisper is MIT.OpenAI Whisper assets in the same repository are MIT.Generally permissive, subject to MIT notice preservation.
sherpa_onnx_moonshine, sherpa_onnx_parakeetsherpa-onnx is Apache-2.0. ONNX Runtime is MIT.The model bundle determines the model license. Useful Sensors Moonshine Streaming is MIT; NVIDIA Parakeet is CC-BY-4.0.Keep the license and attribution files that ship with the selected sherpa-onnx model bundle. Do not assume every bundle has the same terms.
kroko_onnxkroko-onnx is Apache-2.0.Kroko Community models are CC-BY-SA; Commercial/OEM models are separately licensed.CC-BY-SA-style licenses generally require attribution and share-alike. Verify commercial fit with Kroko before using Community models commercially; free license keys are described as non-commercial.
parakeetNVIDIA NeMo source is Apache-2.0; some NeMo Framework containers and deployment artifacts have separate NVIDIA product terms.The default nvidia/parakeet-tdt-0.6b-v3 model card lists CC-BY-4.0 and is ready for commercial/non-commercial use.CC-BY-4.0 requires attribution. Review NVIDIA container/product terms if deploying through NIM, Riva, NeMo containers, or other NVIDIA services.
omnilingual_asrMeta’s omnilingual-asr package and model suite are documented as Apache-2.0.The Meta Omnilingual ASR code and models are Apache-2.0; Meta documents the accompanying corpus as CC-BY-4.0.The model license is permissive. If you use or redistribute corpus data, handle the corpus license separately.
moonshine, moonshine_streamingUses Hugging Face Transformers in RealtimeSTT; check the installed Transformers version and any model custom code.The default UsefulSensors/moonshine-streaming-medium model card lists MIT.English Moonshine Streaming weights are permissive. Non-default Moonshine variants may differ; check the exact model card.
granite_speech, graniteUses Hugging Face Transformers in RealtimeSTT.The default ibm-granite/granite-speech-4.1-2b model card lists Apache-2.0; IBM’s Granite docs describe Granite Speech as Apache-2.0.Generally permissive. Preserve Apache-2.0 notices when redistributing.
qwen3_asr, qwen_asrUses the qwen-asr package and, optionally, vLLM. Check those package licenses for redistribution.The default Qwen/Qwen3-ASR-1.7B model card lists Apache-2.0.Generally permissive for the default model. vLLM, Docker, ModelScope, or hosted serving paths may add their own terms.
cohere_transcribe, cohereUses Hugging Face Transformers and model-specific code from the model repository.The default CohereLabs/cohere-transcribe-03-2026 model card lists Apache-2.0 and describes the model as an open-source release.Generally permissive for the model card license. If access is gated or served through a provider, follow that provider’s access terms too.
openai_apiPlaceholder engine in this repository; not wired as a local ASR backend.N/AIf implemented later, usage would be governed by OpenAI API/service terms, not an open-source model license.

VAD and Wake-Word Licenses

ComponentUpstream termsPractical note
WebRTC VAD via webrtcvad-wheelsPyPI lists MIT.Permissive wrapper; preserve notices if redistributing.
Silero VADsilero-vad is MIT.Permissive for the packaged VAD assets, subject to MIT notices.
OpenWakeWordOpenWakeWord is documented as Apache-2.0.Permissive wake-word path. Verify any third-party wake-word model you add.
Porcupine / PicovoicePicovoice publishes separate service and SDK terms.Treat Porcupine as a provider-licensed dependency. Free trials or free plans may not cover all commercial deployment scenarios.
Porcupine commercial use requires a paid plan. Picovoice’s free tier is suitable for personal or non-commercial projects but has explicit limits on commercial deployments. Review the Picovoice terms of use before shipping any product that uses the Porcupine wake-word backend.

Key Callouts

Whisper Family

Faster-whisper, whisper.cpp, and openai-whisper all use OpenAI Whisper model weights published under MIT. Generally permissive for commercial use, subject to preserving MIT copyright notices.

OpenWakeWord

OpenWakeWord is Apache 2.0. Permissive for commercial use. Verify any third-party wake-word model file you add beyond the defaults.

Kroko / Banafo Models

Community models are CC-BY-SA (attribution and share-alike required). Pro and Commercial/OEM models are separately licensed and require a license key. Do not store license keys in code or committed files.

Porcupine

Porcupine is a commercial product with a free tier. The free tier may not cover production or commercial deployments. Review Picovoice’s terms before using it in a product.
The following upstream sources were checked when compiling this page (last researched: 2026-05-21):

Build docs developers (and LLMs) love