Overview
SuperCmd supports multiple AI providers for different features:- LLM Chat: OpenAI, Anthropic, Google Gemini, Ollama, OpenAI-compatible
- Speech-to-Text: Native macOS, OpenAI Whisper, ElevenLabs
- Text-to-Speech: Edge TTS (free), ElevenLabs, Native macOS
- Memory: Supermemory integration
~/Library/Application Support/SuperCmd/settings.json.
AI features are optional. SuperCmd works perfectly fine without any AI provider configured.
LLM Providers
OpenAI
Models supported:gpt-4o(recommended)gpt-4o-mini(faster, cheaper)gpt-4-turbogpt-3.5-turbo
Get API Key
- Go to platform.openai.com
- Create an account or sign in
- Navigate to API Keys
- Click Create new secret key
- GPT-4o: 15 per 1M output tokens
- GPT-4o-mini: 0.60 per 1M output tokens
Anthropic (Claude)
Models supported:claude-sonnet-4-20250514(recommended)claude-3-5-sonnet-20241022claude-3-5-haiku-20241022claude-3-opus-20240229
Get API Key
- Go to console.anthropic.com
- Create an account
- Navigate to API Keys
- Click Create Key
- Claude Sonnet: 15 per 1M output tokens
- Claude Haiku: 1.25 per 1M output tokens
Google Gemini
Models supported:gemini-2.0-flash-exp(recommended)gemini-1.5-flashgemini-1.5-pro
Get API Key
- Go to aistudio.google.com
- Click Get API Key
- Create a new key or use existing
- Gemini Flash: Free tier available, then $0.075 per 1M tokens
- Gemini Pro: $1.25 per 1M input tokens
Ollama (Local)
Models available:llama3.2(3B, 1B)qwen2.5(7B, 14B, 32B)mistral(7B)- Any model from ollama.com/library
Ollama runs models locally — no API costs, but requires significant RAM and CPU.
- 8GB RAM minimum (16GB recommended)
- Apple Silicon Mac recommended for best performance
- 4-10GB disk space per model
OpenAI-Compatible APIs
Supports any API that follows the OpenAI chat completions format:- OpenRouter
- Together AI
- Groq
- Local LLMs (LM Studio, LocalAI)
Speech-to-Text (STT)
Native macOS (Default)
Free, on-device, no API key required.- English:
en-US,en-GB,en-AU - Spanish:
es-ES,es-MX - French:
fr-FR - German:
de-DE - Italian:
it-IT - Japanese:
ja-JP - Korean:
ko-KR - Chinese:
zh-CN,zh-TW - And 50+ more via
SFSpeechRecognizer
src/native/speech-recognizer.swift
OpenAI Whisper
Models:whisper-1(hosted by OpenAI)
ElevenLabs
Models:eleven_multilingual_v2
Text-to-Speech (TTS)
Edge TTS (Default)
Free, cloud-based, no API key required.- English:
en-US-EricNeural,en-US-JennyNeural,en-GB-SoniaNeural - Spanish:
es-ES-AlvaroNeural,es-MX-DaliaNeural - French:
fr-FR-DeniseNeural - German:
de-DE-ConradNeural - Japanese:
ja-JP-NanamiNeural
src/main/ai-provider.ts (using node-edge-tts)
ElevenLabs
High-quality, natural-sounding voices. Setup:Get API Key
- Go to elevenlabs.io
- Sign up for an account
- Navigate to Profile → API Keys
- Free tier: 10,000 characters/month
- Starter: $5/month for 30,000 characters
- Creator: $22/month for 100,000 characters
Native macOS TTS
On-device, free.NSSpeechSynthesizer — decent quality but less natural than Edge TTS or ElevenLabs.
Supermemory Integration
Add long-term memory to AI chat. Supermemory stores context from previous conversations and retrieves relevant information automatically. Setup:Create Supermemory account
Go to supermemory.ai and sign up
src/main/memory.ts
Speech Correction
Automatically fix speech-to-text errors using an LLM.- Fix capitalization
- Add punctuation
- Correct common speech-to-text errors
Feature Toggles
You can enable/disable individual AI features:| Setting | Feature | Default |
|---|---|---|
enabled | Master AI toggle | true |
llmEnabled | AI chat (? prefix) | true |
whisperEnabled | Voice input (hold-to-speak) | true |
readEnabled | Text-to-speech (Read) | true |
Troubleshooting
AI chat doesn't work
AI chat doesn't work
- Check that
ai.enabledistrue - Verify your API key is correct
- Test API key with curl:
- Check console logs (Cmd+Option+I) for errors
Voice input doesn't transcribe
Voice input doesn't transcribe
- Ensure microphone permission is granted
- Test microphone with Voice Memos app
- Check
ai.speechToTextModelsetting - For Whisper: verify
openaiApiKey - Try switching to
nativemodel
API rate limit errors
API rate limit errors
OpenAI free tier limits:
- 3 requests per minute
- 200 requests per day
- Upgrade to paid plan
- Switch to
gpt-4o-mini(cheaper) - Use Ollama (local, no limits)
Ollama connection refused
Ollama connection refused
- Ensure Ollama is running:
ollama serve - Check base URL:
http://localhost:11434 - Test with:
curl http://localhost:11434/api/tags - Verify model is pulled:
ollama list
Next Steps
Voice Input
Use hold-to-speak dictation
Text-to-Speech
Read selected text aloud
AI Integration
Learn about AI features
Settings
All configuration options