Supported Providers
Prism integrates with the following providers:Anthropic
Claude models with extended thinking, prompt caching, and citations
OpenAI
GPT models with reasoning, image generation, and audio processing
Google Gemini
Multimodal models with search grounding and video analysis
Ollama
Local model deployment for privacy and offline use
Mistral
Magistral reasoning models with OCR capabilities
Groq
Ultra-fast LPU inference for real-time applications
DeepSeek
Efficient models optimized for coding tasks
xAI
Grok models with extended thinking capabilities
OpenRouter
Access multiple providers through a single API
ElevenLabs
Speech-to-text with diarization and event tagging
VoyageAI
Specialized embeddings for search and retrieval
Feature Comparison
Compare capabilities across providers to find the best fit for your use case:Core Features
| Provider | Text Generation | Streaming | Structured Output | Tool Calling | Embeddings |
|---|---|---|---|---|---|
| Anthropic | ✅ | ✅ | ✅ (Native + Tool) | ✅ | ❌ |
| OpenAI | ✅ | ✅ | ✅ (Strict mode) | ✅ | ✅ |
| Gemini | ✅ | ✅ | ✅ (Native) | ✅ | ✅ |
| Ollama | ✅ | ✅ | ✅ (Prompt-based) | ✅ | ✅ |
| Mistral | ✅ | ✅ | ✅ | ✅ | ✅ |
| Groq | ✅ | ✅ | ✅ | ✅ | ❌ |
| DeepSeek | ✅ | ✅ | ✅ | ✅ | ❌ |
| xAI | ✅ | ✅ | ✅ (Strict mode) | ✅ | ❌ |
| OpenRouter | ✅ | ✅ | ✅ | ✅ | ❌ |
| ElevenLabs | ❌ | ❌ | ❌ | ❌ | ❌ |
| VoyageAI | ❌ | ❌ | ❌ | ❌ | ✅ |
Multimodal Support
| Provider | Images | Audio (STT) | Audio (TTS) | Video | Documents | Image Generation |
|---|---|---|---|---|---|---|
| Anthropic | ✅ | ❌ | ❌ | ❌ | ✅ (PDF) | ❌ |
| OpenAI | ✅ | ✅ (Whisper) | ✅ | ❌ | ❌ | ✅ (DALL-E) |
| Gemini | ✅ | ✅ | ❌ | ✅ (YouTube) | ✅ | ✅ (Imagen) |
| Ollama | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Mistral | ✅ | ✅ (Voxtral) | ❌ | ❌ | ✅ (URL) | ❌ |
| Groq | ✅ | ✅ (Whisper) | ✅ (PlayAI) | ❌ | ❌ | ❌ |
| DeepSeek | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| xAI | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| OpenRouter | ✅ | ❌ | ❌ | ✅ | ✅ | ❌ |
| ElevenLabs | ❌ | ✅ (Scribe) | ❌ | ❌ | ❌ | ❌ |
| VoyageAI | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
Advanced Capabilities
| Provider | Reasoning/Thinking | Prompt Caching | Citations | Code Execution | Search Grounding | Moderation |
|---|---|---|---|---|---|---|
| Anthropic | ✅ (Extended) | ✅ (Ephemeral) | ✅ | ✅ | ❌ | ❌ |
| OpenAI | ✅ (GPT-5) | ✅ (Auto) | ❌ | ✅ | ❌ | ✅ |
| Gemini | ✅ (Thinking) | ✅ | ✅ | ❌ | ✅ (Google) | ❌ |
| Ollama | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Mistral | ✅ (Magistral) | ❌ | ❌ | ❌ | ❌ | ❌ |
| Groq | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| DeepSeek | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| xAI | ✅ (Extended) | ❌ | ❌ | ❌ | ❌ | ❌ |
| OpenRouter | ✅ (Model-dependent) | ❌ | ❌ | ❌ | ❌ | ❌ |
| ElevenLabs | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| VoyageAI | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
Configuration
All providers are configured inconfig/prism.php:
Choosing a Provider
For Production Applications
- Best Overall
- Multimodal
- Speed
- Privacy
Anthropic Claude - Industry-leading reasoning, long context windows, and reliable structured output with prompt caching for cost optimization.
For Specific Use Cases
Complex Reasoning Tasks
Complex Reasoning Tasks
Use providers with reasoning/thinking capabilities:
- Anthropic Claude 3.7 - Extended thinking with budget control
- OpenAI GPT-5 - Reasoning effort levels (low/medium/high)
- Mistral Magistral - Efficient reasoning for complex problems
- xAI Grok - Extended thinking mode
Audio Processing
Audio Processing
Speech-to-Text:
- OpenAI Whisper (via OpenAI/Groq) - Industry standard
- Mistral Voxtral - Multilingual with 30min support
- ElevenLabs Scribe - Diarization and event tagging
- OpenAI TTS - Natural voices with HD quality
- Groq PlayAI - Fast TTS with Arabic support
Image Generation
Image Generation
- OpenAI DALL-E 3 - Highest quality with prompt revision
- OpenAI GPT-Image-1 - Advanced editing with masks
- Gemini Imagen 4 - HD generation with aspect ratio control
Embeddings
Embeddings
- VoyageAI - Specialized for search/retrieval tasks
- OpenAI - General purpose embeddings
- Gemini - Task-specific embeddings
Cost Optimization
Cost Optimization
- Anthropic - Prompt caching (5m/1h TTL)
- OpenAI - Automatic caching for structured output
- Gemini - Content caching with custom TTL
Multiple Providers
Multiple Providers
OpenRouter - Access multiple providers through a single API with automatic fallback routing.
Provider Reliability
Structured Output Reliability
Native Support (Most Reliable):- Anthropic (Claude Sonnet 4.5+, Opus 4.1+)
- OpenAI (with strict mode)
- Gemini (native structured output)
- xAI (strict mode)
- Anthropic (non-native models)
- OpenRouter
- Ollama
- DeepSeek
Message Order Requirements
Quick Start by Provider
Next Steps
Provider Details
Explore detailed documentation for each provider
Core Concepts
Learn about text generation, streaming, and tools
Structured Output
Generate type-safe structured data
Multimodal Input
Work with images, audio, video, and documents