On-Device Speech AI forApple Silicon
Deploy state-of-the-art speech-to-text and text-to-speech models directly on iOS, macOS, watchOS, and visionOS with real-time streaming, voice activity detection, and more.
Quick start
Get up and running with WhisperKit in minutes
Install via Swift Package Manager
Package.swift file:Initialize WhisperKit
Transcribe audio
transcribe method to convert audio files to text:Core features
Everything you need for on-device speech AI
Speech-to-Text
Text-to-Speech
Voice Activity Detection
Multi-Platform
Local Server
Model Management
Explore by topic
Deep dive into WhisperKit capabilities
Real-Time Streaming
Stream audio from microphone with live transcription using AudioStreamTranscriber
Streaming Playback
Real-time audio streaming with intelligent buffering strategies for smooth playback
Custom Models
Fine-tune and deploy your own Whisper models with whisperkittools
Performance Optimization
Optimize inference speed and memory usage with compute unit selection
Resources
Additional resources to help you succeed
Model Catalog
Benchmarks
Contributing
Discord Community
Ready to get started?
Build powerful on-device speech applications with WhisperKit today