Major providers
OpenAI
Chat, completions, embeddings, image generation, audio speech, transcription, translation, files, batches, fine-tuning, and realtime WebSocket APIs.
Anthropic
Chat completions and completions via the native Messages API or the OpenAI-compatible interface.
Google Gemini
Chat completions and embeddings via the Gemini API.
Google Vertex AI
Chat completions, embeddings, image generation, fine-tuning, batches, and Anthropic Messages on Vertex.
Azure OpenAI
Chat, completions, embeddings, image generation, audio, batches, and fine-tuning through Azure-hosted OpenAI models.
AWS Bedrock
Chat, completions, embeddings, image generation, batches, fine-tuning, and Anthropic Messages on Bedrock.
Mistral AI
Chat completions and embeddings.
Cohere
Chat, completions, embeddings, files, and batches.
Using a provider
Setprovider to the provider’s identifier and pass its API key as Authorization. The request shape is identical across all providers.
All providers
The table below lists every provider available in the gateway. Feature support is derived from the handler files present in each provider’s source directory.| Provider | ID | Chat | Completions | Embeddings | Images | Audio | Batches | Fine-tuning |
|---|---|---|---|---|---|---|---|---|
| OpenAI | openai | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Azure OpenAI | azure-openai | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Anthropic | anthropic | ✅ | ✅ | — | — | — | — | — |
| Google Gemini | google | ✅ | — | ✅ | — | — | — | — |
| Google Vertex AI | vertex-ai | ✅ | — | ✅ | ✅ | — | ✅ | ✅ |
| AWS Bedrock | bedrock | ✅ | ✅ | ✅ | ✅ | — | ✅ | ✅ |
| AWS SageMaker | sagemaker | ✅ | — | — | — | — | — | — |
| Azure AI Inference | azure-ai-inference | ✅ | — | — | — | — | — | — |
| Mistral AI | mistral-ai | ✅ | — | ✅ | — | — | — | — |
| Cohere | cohere | ✅ | ✅ | ✅ | — | — | ✅ | — |
| Groq | groq | ✅ | — | — | — | — | — | — |
| Together AI | together-ai | ✅ | ✅ | ✅ | — | — | — | — |
| Fireworks AI | fireworks-ai | ✅ | ✅ | ✅ | ✅ | — | — | ✅ |
| Perplexity AI | perplexity-ai | ✅ | — | — | — | — | — | — |
| DeepSeek | deepseek | ✅ | — | — | — | — | — | — |
| DeepInfra | deepinfra | ✅ | — | — | — | — | — | — |
| Ollama | ollama | ✅ | — | ✅ | — | — | — | — |
| OpenRouter | openrouter | ✅ | — | — | — | — | — | — |
| Anyscale | anyscale | ✅ | — | — | — | — | — | — |
| Hugging Face | huggingface | ✅ | ✅ | — | — | — | — | — |
| Replicate | replicate | ✅ | — | — | — | — | — | — |
| Stability AI | stability-ai | — | — | — | ✅ | — | — | — |
| Cloudflare Workers AI | workers-ai | ✅ | ✅ | ✅ | ✅ | — | — | — |
| Novita AI | novita-ai | ✅ | — | — | — | — | — | — |
| Cerebras | cerebras | ✅ | — | — | — | — | — | — |
| X AI (Grok) | x-ai | ✅ | — | — | — | — | — | — |
| Hyperbolic | hyperbolic | ✅ | — | — | — | — | — | — |
| SambaNova | sambanova | ✅ | — | — | — | — | — | — |
| Nebius | nebius | ✅ | — | — | — | — | — | — |
| Lambda | lambda | ✅ | — | — | — | — | — | — |
| Modal | modal | ✅ | — | — | — | — | — | — |
| Predibase | predibase | ✅ | — | — | — | — | — | — |
| Lepton AI | lepton | ✅ | — | — | — | — | — | — |
| Moonshot | moonshot | ✅ | — | — | — | — | — | — |
| Voyage AI | voyage | — | — | ✅ | — | — | — | — |
| Nomic | nomic | — | — | ✅ | — | — | — | — |
| Jina | jina | — | — | ✅ | — | — | — | — |
| AI21 | ai21 | ✅ | — | — | — | — | — | — |
| Reka AI | reka-ai | ✅ | — | — | — | — | — | — |
| Upstage | upstage | ✅ | — | — | — | — | — | — |
| OVHcloud | ovhcloud | ✅ | — | — | — | — | — | — |
| Oracle Cloud | oracle | ✅ | — | — | — | — | — | — |
| Kluster AI | kluster-ai | ✅ | — | — | — | — | — | — |
| nScale | nscale | ✅ | — | — | — | — | — | — |
| nCompass | ncompass | ✅ | — | — | — | — | — | — |
| Featherless AI | featherless-ai | ✅ | — | — | — | — | — | — |
| Inference.net | inference-net | ✅ | — | — | — | — | — | — |
| IO Intelligence | iointelligence | ✅ | — | — | — | — | — | — |
| Segmind | segmind | — | — | — | ✅ | — | — | — |
| Recraft AI | recraft-ai | — | — | — | ✅ | — | — | — |
| Meshy | meshy | — | — | — | ✅ | — | — | — |
| Tripo3D | tripo3d | — | — | — | ✅ | — | — | — |
| SiliconFlow | siliconflow | ✅ | — | — | — | — | — | — |
| Dashscope (Alibaba) | dashscope | ✅ | — | — | — | — | — | — |
| Zhipu AI | zhipu | ✅ | — | — | — | — | — | — |
| Lingyi (01.AI) | lingyi | ✅ | — | — | — | — | — | — |
| Krutrim | krutrim | ✅ | — | — | — | — | — | — |
| Monsterapi | monsterapi | ✅ | — | — | — | — | — | — |
| Triton | triton | ✅ | — | — | — | — | — | — |
| Cortex | cortex | ✅ | — | — | — | — | — | — |
| Palm | palm | ✅ | — | — | — | — | — | — |
| Nextbit | nextbit | ✅ | — | — | — | — | — | — |
| Bytez | bytez | ✅ | — | — | — | — | — | — |
| CometAPI | cometapi | ✅ | — | — | — | — | — | — |
| Deepbricks | deepbricks | ✅ | — | — | — | — | — | — |
| LemonFox AI | lemonfox-ai | ✅ | — | — | — | — | — | — |
| Matterai | matterai | ✅ | — | — | — | — | — | — |
| Hyperbolic | hyperbolic | ✅ | — | — | — | — | — | — |
| Qdrant | qdrant | — | — | ✅ | — | — | — | — |
| Milvus | milvus | — | — | ✅ | — | — | — | — |
| AI Badger | aibadgr | ✅ | — | — | — | — | — | — |
| 302.AI | 302ai | ✅ | — | — | — | — | — | — |
| Z AI | z-ai | ✅ | — | — | — | — | — | — |
The gateway also proxies any OpenAI-compatible endpoint via the
openai-base and anthropic-base providers. Point custom_host at any self-hosted or third-party API that speaks the OpenAI or Anthropic wire format.Provider categories
- OpenAI-compatible
- Cloud providers
- Open source / self-hosted
- Specialized
These providers expose an OpenAI-compatible API. The gateway routes to them with no request transformation.
- OpenAI —
openai - Azure OpenAI —
azure-openai - Groq —
groq - OpenRouter —
openrouter - DeepSeek —
deepseek - Cerebras —
cerebras - X AI (Grok) —
x-ai - SambaNova —
sambanova - Nebius —
nebius - Lambda —
lambda - Hyperbolic —
hyperbolic - Together AI —
together-ai - Fireworks AI —
fireworks-ai - Anyscale —
anyscale - Perplexity AI —
perplexity-ai