KittenTTS requires Python 3.8 or higher and runs on Linux, macOS, and Windows. Models run on CPU only — no GPU required.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/KittenML/KittenTTS/llms.txt
Use this file to discover all available pages before exploring further.
System requirements
- Python 3.8+
- pip
- 25–80 MB disk space per model
- Linux, macOS, or Windows
Install KittenTTS
Create a virtual environment
Using a virtual environment keeps your project dependencies isolated.
- venv
- conda
Install KittenTTS
Install directly from the GitHub release wheel:This installs KittenTTS and all required dependencies:
onnxruntime, soundfile, numpy, spacy, misaki, espeakng_loader, num2words, and huggingface_hub.Models are downloaded automatically from Hugging Face the first time you load them. They are cached locally for subsequent runs.
Troubleshooting
espeakng dependency error
espeakng dependency error
KittenTTS uses On Windows and macOS,
espeakng_loader to handle phoneme conversion. On some Linux systems, you may need to install the eSpeak NG system library:espeakng_loader typically bundles the required binaries, so no additional steps are needed.spaCy model errors
spaCy model errors
If you see errors related to spaCy language models, install the required English model:
pip SSL or network errors
pip SSL or network errors
If pip cannot reach the GitHub release URL, try downloading the wheel manually and installing from the local file:
Python version not supported
Python version not supported
KittenTTS requires Python 3.8 or higher. Check your version:If you are on an older version, install a supported Python release from python.org.