KittenTTS provides three model families at different sizes, letting you trade off quality, speed, and footprint depending on your target environment.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/KittenML/KittenTTS/llms.txt
Use this file to discover all available pages before exploring further.
Model comparison
| Model | Parameters | Size | Quality | Best for |
|---|---|---|---|---|
| kitten-tts-mini | 80M | 80 MB | Highest | Production apps, quality-first use cases |
| kitten-tts-micro | 40M | 41 MB | Balanced | Most applications |
| kitten-tts-nano (fp32) | 15M | 56 MB | Good | Edge devices, real-time inference |
| kitten-tts-nano (int8) | 15M | 25 MB | Good | Embedded systems, mobile |
Choosing a model
kitten-tts-mini
80M params — 80 MBHighest output quality. Use when size is not a constraint and you need the best results for production applications.
kitten-tts-micro
40M params — 41 MBBalanced quality and speed. A good default for most applications where you want a smaller footprint without sacrificing too much quality.
kitten-tts-nano
15M params — 56 MB (fp32) / 25 MB (int8)Fastest inference. Best for edge devices, embedded systems, and real-time synthesis on constrained hardware.
kitten-tts-nano (int8)
15M params — 25 MBSmallest footprint via int8 quantization. Targets mobile and embedded deployments. See the nano page for known issues.
Switching between models
Pass the Hugging Face repo ID toKittenTTS to select a model. All models are downloaded automatically on first use.
Model caching
By default, models are cached in the platform’s standard cache directory. Use thecache_dir parameter to specify a custom location, which is useful for shared environments or when managing disk space explicitly.