Skip to main content

Major providers

OpenAI

Chat, completions, embeddings, image generation, audio speech, transcription, translation, files, batches, fine-tuning, and realtime WebSocket APIs.

Anthropic

Chat completions and completions via the native Messages API or the OpenAI-compatible interface.

Google Gemini

Chat completions and embeddings via the Gemini API.

Google Vertex AI

Chat completions, embeddings, image generation, fine-tuning, batches, and Anthropic Messages on Vertex.

Azure OpenAI

Chat, completions, embeddings, image generation, audio, batches, and fine-tuning through Azure-hosted OpenAI models.

AWS Bedrock

Chat, completions, embeddings, image generation, batches, fine-tuning, and Anthropic Messages on Bedrock.

Mistral AI

Chat completions and embeddings.

Cohere

Chat, completions, embeddings, files, and batches.

Using a provider

Set provider to the provider’s identifier and pass its API key as Authorization. The request shape is identical across all providers.
from portkey_ai import Portkey

client = Portkey(
    provider="openai",
    Authorization="sk-***"
)

client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello!"}]
)

All providers

The table below lists every provider available in the gateway. Feature support is derived from the handler files present in each provider’s source directory.
ProviderIDChatCompletionsEmbeddingsImagesAudioBatchesFine-tuning
OpenAIopenai
Azure OpenAIazure-openai
Anthropicanthropic
Google Geminigoogle
Google Vertex AIvertex-ai
AWS Bedrockbedrock
AWS SageMakersagemaker
Azure AI Inferenceazure-ai-inference
Mistral AImistral-ai
Coherecohere
Groqgroq
Together AItogether-ai
Fireworks AIfireworks-ai
Perplexity AIperplexity-ai
DeepSeekdeepseek
DeepInfradeepinfra
Ollamaollama
OpenRouteropenrouter
Anyscaleanyscale
Hugging Facehuggingface
Replicatereplicate
Stability AIstability-ai
Cloudflare Workers AIworkers-ai
Novita AInovita-ai
Cerebrascerebras
X AI (Grok)x-ai
Hyperbolichyperbolic
SambaNovasambanova
Nebiusnebius
Lambdalambda
Modalmodal
Predibasepredibase
Lepton AIlepton
Moonshotmoonshot
Voyage AIvoyage
Nomicnomic
Jinajina
AI21ai21
Reka AIreka-ai
Upstageupstage
OVHcloudovhcloud
Oracle Cloudoracle
Kluster AIkluster-ai
nScalenscale
nCompassncompass
Featherless AIfeatherless-ai
Inference.netinference-net
IO Intelligenceiointelligence
Segmindsegmind
Recraft AIrecraft-ai
Meshymeshy
Tripo3Dtripo3d
SiliconFlowsiliconflow
Dashscope (Alibaba)dashscope
Zhipu AIzhipu
Lingyi (01.AI)lingyi
Krutrimkrutrim
Monsterapimonsterapi
Tritontriton
Cortexcortex
Palmpalm
Nextbitnextbit
Bytezbytez
CometAPIcometapi
Deepbricksdeepbricks
LemonFox AIlemonfox-ai
Matteraimatterai
Hyperbolichyperbolic
Qdrantqdrant
Milvusmilvus
AI Badgeraibadgr
302.AI302ai
Z AIz-ai
The gateway also proxies any OpenAI-compatible endpoint via the openai-base and anthropic-base providers. Point custom_host at any self-hosted or third-party API that speaks the OpenAI or Anthropic wire format.

Provider categories

These providers expose an OpenAI-compatible API. The gateway routes to them with no request transformation.
  • OpenAIopenai
  • Azure OpenAIazure-openai
  • Groqgroq
  • OpenRouteropenrouter
  • DeepSeekdeepseek
  • Cerebrascerebras
  • X AI (Grok)x-ai
  • SambaNovasambanova
  • Nebiusnebius
  • Lambdalambda
  • Hyperbolichyperbolic
  • Together AItogether-ai
  • Fireworks AIfireworks-ai
  • Anyscaleanyscale
  • Perplexity AIperplexity-ai

Build docs developers (and LLMs) love