Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/lvndry/jazz/llms.txt

Use this file to discover all available pages before exploring further.

Jazz is provider-agnostic—you can use OpenAI, Anthropic, Google, local models with Ollama, or access hundreds of models through OpenRouter. Switch providers mid-conversation, use different models for different agents, or route to the cheapest/fastest option for each task.

Supported providers

Jazz supports 15+ LLM providers:
ProviderModelsTransport
AnthropicClaude 4.5 Sonnet, Opus, HaikuAPI
OpenAIGPT-4, GPT-4 Turbo, GPT-3.5, o1, o3API
GoogleGemini Pro, Gemini Flash, Gemini NanoAPI
xAIGrok, Grok VisionAPI
DeepSeekDeepSeek Chat, DeepSeek CoderAPI
MistralMistral Large, Medium, SmallAPI
GroqLlama, Mixtral (fast inference)API
CerebrasLlama 3.1 (ultra-fast inference)API
FireworksMultiple open modelsAPI
TogetherAIMultiple open modelsAPI
OpenRouter200+ models from all providersAPI
OllamaAny local modelLocal
AI GatewayCustom OpenAI-compatible endpointsAPI
Source: models.ts:18-89, create-agent.ts:118

Provider configuration

Setting API keys

During agent creation, Jazz prompts for missing API keys:
jazz agent create
Which LLM provider would you like to use?
❯ Anthropic
  OpenAI
  Google
  ...

⚠ API key not set in config file for Anthropic.
Please paste your API key below:

Anthropic API Key: sk-ant-...
✓ API key saved to config file.
Keys are stored in ~/.jazz/config.json:
{
  "llm": {
    "anthropic": {
      "api_key": "sk-ant-..."
    },
    "openai": {
      "api_key": "sk-..."
    }
  }
}
Source: create-agent.ts:413-447

Manual configuration

Edit ~/.jazz/config.json directly:
{
  "llm": {
    "anthropic": {
      "api_key": "sk-ant-api03-..."
    },
    "openai": {
      "api_key": "sk-proj-...",
      "base_url": "https://api.openai.com/v1"  // Optional
    },
    "google": {
      "api_key": "AIza..."
    },
    "ollama": {
      "base_url": "http://localhost:11434/api"  // Optional, defaults to localhost
    },
    "openrouter": {
      "api_key": "sk-or-v1-..."
    }
  }
}

Provider details

Anthropic (Claude)

Best for: complex reasoning, long-form writing, code analysis Models:
  • claude-4.5-sonnet - Balanced performance and cost
  • claude-4.5-opus - Maximum capability
  • claude-3-haiku - Fast and economical
Configuration:
{
  "llm": {
    "anthropic": {
      "api_key": "sk-ant-..."
    }
  }
}
Get API key: console.anthropic.com Source: models.ts:19-22

OpenAI (GPT)

Best for: general tasks, function calling, structured output Models:
  • gpt-4 - General purpose
  • gpt-4-turbo - Faster, cheaper GPT-4
  • gpt-3.5-turbo - Fast and economical
  • o1-preview - Advanced reasoning (slow)
  • o3-mini - Reasoning with cost optimization
Configuration:
{
  "llm": {
    "openai": {
      "api_key": "sk-proj-...",
      "base_url": "https://api.openai.com/v1"  // Optional
    }
  }
}
Get API key: platform.openai.com Source: models.ts:23-26

Google (Gemini)

Best for: multimodal tasks, large context windows Models:
  • gemini-2.0-flash-exp - Latest, fastest
  • gemini-1.5-pro - Large context (2M tokens)
  • gemini-1.5-flash - Fast and economical
Configuration:
{
  "llm": {
    "google": {
      "api_key": "AIza..."
    }
  }
}
Get API key: aistudio.google.com Source: models.ts:27-30

OpenRouter (Multi-provider)

Best for: accessing many models with one API key, comparing models 200+ models including:
  • All Anthropic Claude models
  • All OpenAI GPT models
  • Google Gemini
  • Meta Llama
  • Mistral, DeepSeek, and more
Configuration:
{
  "llm": {
    "openrouter": {
      "api_key": "sk-or-v1-..."
    }
  }
}
Free tier: Use the Free Models Router for no-cost access:
jazz agent create
# Provider: OpenRouter
# Model: openrouter/free  # Automatically routes to free models
Get API key: openrouter.ai/keys Source: models.ts:35-38, README.md:49

Ollama (Local models)

Best for: privacy, offline use, no API costs Run any model locally:
# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull a model
ollama pull llama3.1
ollama pull qwen2.5-coder
ollama pull mistral
Configuration:
{
  "llm": {
    "ollama": {
      "base_url": "http://localhost:11434/api"
    }
  }
}
No API key required - Ollama is optional auth. Source: models.ts:79-83, create-agent.ts:430-431

Groq (Fast inference)

Best for: speed-critical tasks, real-time applications Models:
  • Llama models with extremely fast inference
  • Mixtral models
Configuration:
{
  "llm": {
    "groq": {
      "api_key": "gsk_..."
    }
  }
}
Get API key: console.groq.com Source: models.ts:62-66

DeepSeek (Specialized models)

Best for: coding tasks, technical analysis Models:
  • deepseek-chat - General purpose
  • deepseek-coder - Code generation and analysis
Configuration:
{
  "llm": {
    "deepseek": {
      "api_key": "sk-..."
    }
  }
}
Get API key: platform.deepseek.com Source: models.ts:54-57

Reasoning models

Some models support extended reasoning with configurable effort:

OpenAI o1/o3 series

jazz agent create
# Provider: OpenAI
# Model: o1-preview

What reasoning effort level would you like?
 Medium - Balanced speed and reasoning depth (recommended)
  Low - Faster responses, basic reasoning
  High - Deep reasoning, slower responses
  Disable - No reasoning effort (fastest)
Reasoning effort controls how much “thinking time” the model uses:
  • Low - Fast, basic reasoning
  • Medium - Balanced (recommended)
  • High - Deep analysis, slower
  • Disable - Standard completion mode
Source: create-agent.ts:496-522

Dynamic model selection

Some providers fetch models dynamically from their API:
ProviderModel SourceRefresh
OpenRouterAPI (/api/v1/models)On agent creation
GroqAPI (/models)On agent creation
OllamaAPI (/tags)On agent creation
CerebrasAPI (/v1/models)On agent creation
FireworksAPI (/v1/accounts/fireworks/models)On agent creation
TogetherAIAPI (/v1/models)On agent creation
Static providers (Anthropic, OpenAI, Google) have hardcoded model lists. Source: models.ts:12-89

Switching models mid-conversation

In any chat session:
/model
Select a different model:
Switch model:
❯ claude-4.5-sonnet
  gpt-4-turbo
  gemini-2.0-flash-exp
  llama3.1 (ollama)
  ...
Conversation continues with the new model, preserving context.

Model selection by task

Code review and analysis

Best: claude-4.5-sonnet, gpt-4, deepseek-coder
jazz agent create
# Provider: Anthropic
# Model: claude-4.5-sonnet
# Persona: coder

Research and writing

Best: claude-4.5-opus, gpt-4-turbo, gemini-1.5-pro
jazz agent create
# Provider: Google
# Model: gemini-1.5-pro
# Persona: researcher

Fast iterations

Best: claude-3-haiku, gpt-3.5-turbo, gemini-1.5-flash, groq/llama3
jazz agent create
# Provider: Groq
# Model: llama3-70b-8192

Local/offline

Best: Ollama with any model
ollama pull llama3.1

jazz agent create
# Provider: Ollama
# Model: llama3.1

Budget-conscious

Best: OpenRouter free tier, Ollama
jazz agent create
# Provider: OpenRouter
# Model: openrouter/free

Custom OpenAI-compatible endpoints

Use the ai_gateway provider for custom endpoints:
{
  "llm": {
    "ai_gateway": {
      "api_key": "your-key",
      "base_url": "https://your-gateway.com/v1"
    }
  }
}
Works with:
  • Azure OpenAI
  • Custom LLM proxies
  • Self-hosted OpenAI-compatible APIs
  • Enterprise gateways
Source: models.ts:40-43

Provider comparison

By use case

Use CaseRecommended ProviderModel
Code reviewAnthropicclaude-4.5-sonnet
Long-form writingAnthropicclaude-4.5-opus
ResearchGooglegemini-1.5-pro
Fast iterationsGroqllama3-70b
Privacy/offlineOllamallama3.1
BudgetOpenRouterFree Models Router
ReasoningOpenAIo1-preview
General purposeOpenAIgpt-4-turbo

By cost

TierProvidersNotes
FreeOpenRouter (free tier), OllamaNo cost, rate limited (OpenRouter) or local (Ollama)
LowClaude 3 Haiku, GPT-3.5 Turbo, Gemini Flash$0.25-0.50 per 1M tokens
MediumClaude 4.5 Sonnet, GPT-4 Turbo, Gemini Pro$3-15 per 1M tokens
HighClaude 4.5 Opus, GPT-4, o1$15-60 per 1M tokens

By speed

SpeedProvidersModels
Ultra-fastGroq, CerebrasLlama3, Mixtral
FastAllHaiku, GPT-3.5, Flash
StandardAllSonnet, GPT-4 Turbo, Pro
SlowOpenAIo1, o3 (reasoning)

Troubleshooting

API key not working

Verify the key is correct:
jazz config show
Check llm.<provider>.api_key value. Update if needed:
# Edit config
vim ~/.jazz/config.json

# Or re-run agent creation to prompt for new key
jazz agent create

Model not available

For dynamic providers (OpenRouter, Ollama), models are fetched at agent creation time. If a model is missing:
  1. Ollama: Pull the model first
    ollama pull model-name
    
  2. OpenRouter: Check if model exists at openrouter.ai/models
  3. Other providers: Model may be deprecated or renamed

Rate limit errors

Switch to a different provider or upgrade your plan:
/model
# Select alternative provider

Connection timeout

For local Ollama:
# Verify Ollama is running
ollama list

# Check base URL in config
cat ~/.jazz/config.json | grep ollama
For remote APIs:
# Test connectivity
curl https://api.anthropic.com/v1/messages \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01"

Best practices

Use different models for different tasks - Fast models for iteration, powerful models for complex analysis.
Start with free options - OpenRouter free tier or Ollama for experimentation.
Configure multiple providers - Have fallback options when one provider has issues.
Secure API keys - Never commit API keys to version control. Use environment variables or config files with restricted permissions.
Monitor costs - Use cheaper models for high-volume workflows. Reserve expensive models for critical tasks.

Environment variables

Override config with environment variables:
# Provider-specific
export ANTHROPIC_API_KEY=sk-ant-...
export OPENAI_API_KEY=sk-proj-...
export GOOGLE_API_KEY=AIza...

# Run Jazz
jazz agent chat my-agent
Environment variables take precedence over config file values.

Next steps

Build docs developers (and LLMs) love