Models overview

Model	Parameters	Size	Quality	Best for
kitten-tts-mini	80M	80 MB	Highest	Production apps, quality-first use cases
kitten-tts-micro	40M	41 MB	Balanced	Most applications
kitten-tts-nano (fp32)	15M	56 MB	Good	Edge devices, real-time inference
kitten-tts-nano (int8)	15M	25 MB	Good	Embedded systems, mobile

Model

Parameters

Size

Quality

Best for

kitten-tts-mini

80M

80 MB

Highest

Production apps, quality-first use cases

kitten-tts-micro

40M

41 MB

Balanced

Most applications

kitten-tts-nano (fp32)

15M

56 MB

Good

Edge devices, real-time inference

kitten-tts-nano (int8)

15M

25 MB

Good

Embedded systems, mobile

Choosing a model

kitten-tts-mini

80M params — 80 MBHighest output quality. Use when size is not a constraint and you need the best results for production applications.

kitten-tts-micro

40M params — 41 MBBalanced quality and speed. A good default for most applications where you want a smaller footprint without sacrificing too much quality.

kitten-tts-nano

15M params — 56 MB (fp32) / 25 MB (int8)Fastest inference. Best for edge devices, embedded systems, and real-time synthesis on constrained hardware.

kitten-tts-nano (int8)

15M params — 25 MBSmallest footprint via int8 quantization. Targets mobile and embedded deployments. See the nano page for known issues.

Switching between models

Pass the Hugging Face repo ID to KittenTTS to select a model. All models are downloaded automatically on first use.

from kittentts import KittenTTS

# Highest quality (80M params, 80MB)
model = KittenTTS("KittenML/kitten-tts-mini-0.8")

# Balanced (40M params, 41MB)
model = KittenTTS("KittenML/kitten-tts-micro-0.8")

# Fastest (15M params, 56MB)
model = KittenTTS("KittenML/kitten-tts-nano-0.8")

# Smallest (15M params, 25MB, int8 quantized)
model = KittenTTS("KittenML/kitten-tts-nano-0.8-int8")

Model caching

By default, models are cached in the platform’s standard cache directory. Use the cache_dir parameter to specify a custom location, which is useful for shared environments or when managing disk space explicitly.

model = KittenTTS("KittenML/kitten-tts-mini-0.8", cache_dir="/data/models")

Set cache_dir to a persistent volume when running in containers so the model is not re-downloaded on every restart.

Get Started

Concepts

Guides

Models

Model comparison

Choosing a model

kitten-tts-mini

kitten-tts-micro

kitten-tts-nano

kitten-tts-nano (int8)

Switching between models

Model caching

Build docs developers (and LLMs) love

Get Started

Concepts

Guides

Models

Documentation Index

​Model comparison

​Choosing a model

kitten-tts-mini

kitten-tts-micro

kitten-tts-nano

kitten-tts-nano (int8)

​Switching between models

​Model caching

Build docs developers (and LLMs) love

Model comparison

Choosing a model

Switching between models

Model caching