LLM Providers

The library supports 15+ LLM providers with unified authentication and configuration. All providers work through the same API surface.

Supported Providers

API Key Providers

These providers use static API keys for authentication:

OpenAI

Models: GPT-4o, GPT-4o-mini, o1-preview, o1-mini, o3-miniAPI: openai-responses (Responses API)Features: Vision, reasoning, tool callingSetup:

export OPENAI_API_KEY="sk-..."

import { getModel, complete } from '@mariozechner/pi-ai';

const model = getModel('openai', 'gpt-4o-mini');
const response = await complete(model, context);

Anthropic

Models: Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude Sonnet 4API: anthropic-messages (Messages API)Features: Vision, thinking (Sonnet 4), tool calling, prompt cachingSetup:

export ANTHROPIC_API_KEY="sk-ant-..."

For Claude Pro/Max subscribers, use OAuth (see OAuth section).

Google

Models: Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini 2.0 Flash, Gemini 2.5 FlashAPI: google-generative-ai (Generative AI API)Features: Vision, thinking, tool callingSetup:

export GEMINI_API_KEY="..."

Note: Google provider does not support function call streaming. You receive a single toolcall_delta event with complete arguments.

xAI (Grok)

Models: Grok Beta, Grok Code Fast, Grok Vision BetaAPI: openai-completions (OpenAI-compatible)Features: Vision, reasoning, tool callingSetup:

export XAI_API_KEY="xai-..."

Groq

Models: Llama 3.3 70B, Llama 3.1 70B, Mixtral 8x7B, GPT-OSS modelsAPI: openai-completions (OpenAI-compatible)Features: Extremely fast inference, tool callingSetup:

export GROQ_API_KEY="gsk_..."

Cerebras

Models: Llama 3.3 70B, Llama 3.1 70B, GPT-OSS modelsAPI: openai-completions (OpenAI-compatible)Features: Fast inference, tool callingSetup:

export CEREBRAS_API_KEY="csk-..."

Mistral

Models: Mistral Large, Mistral Small, Codestral, PixtralAPI: openai-completions (OpenAI-compatible)Features: Vision (Pixtral), tool callingSetup:

export MISTRAL_API_KEY="..."

Note: Mistral requires tool call IDs to be exactly 9 alphanumeric characters. The library handles this automatically.

OpenRouter

Models: Access to 100+ models from multiple providersAPI: openai-completions (OpenAI-compatible)Features: Multi-provider routing, fallback supportSetup:

export OPENROUTER_API_KEY="sk-or-..."

const model = getModel('openrouter', 'anthropic/claude-3.5-sonnet');

// Control provider routing
const response = await complete(model, context, {
  compat: {
    openRouterRouting: {
      only: ['anthropic', 'openai'],
      order: ['anthropic', 'openai']
    }
  }
});

Amazon Bedrock

Models: Claude models via AWS BedrockAPI: bedrock-converse-stream (Bedrock Converse API)Features: Vision, tool calling, AWS authenticationSetup: Uses AWS SDK credentials (IAM roles, profiles, or environment variables)

import { getModel, complete } from '@mariozechner/pi-ai';

const model = getModel('amazon-bedrock', 'anthropic.claude-3-5-sonnet-20241022-v2:0');
const response = await complete(model, context);

MiniMax

Models: MiniMax models (Chinese provider)API: openai-completions (OpenAI-compatible)Setup:

export MINIMAX_API_KEY="..."

Kimi For Coding

Models: Moonshot AI models with Anthropic-compatible APIAPI: openai-completions (OpenAI-compatible)Setup:

export KIMI_API_KEY="..."

Vercel AI Gateway

Models: Multi-provider gateway with caching and analyticsAPI: openai-completions (OpenAI-compatible)Setup:

export AI_GATEWAY_API_KEY="..."

Azure OpenAI

Models: GPT-4o, GPT-4o-mini via AzureAPI: azure-openai-responses (Responses API)Setup:

export AZURE_OPENAI_API_KEY="..."
export AZURE_OPENAI_BASE_URL="https://your-resource.openai.azure.com"
# Or use resource name (constructs URL automatically)
export AZURE_OPENAI_RESOURCE_NAME="your-resource"

# Optional: Override deployment names
export AZURE_OPENAI_DEPLOYMENT_NAME_MAP="gpt-4o-mini=my-deployment,gpt-4o=prod"
# Optional: API version (defaults to v1)
export AZURE_OPENAI_API_VERSION="v1"

const model = getModel('azure-openai-responses', 'gpt-4o-mini');
// Or override deployment name in options
const response = await complete(model, context, {
  azureDeploymentName: 'my-custom-deployment'
});

OAuth Providers

These providers require OAuth authentication:

OpenAI Codex

Models: GPT-5 Mini, GPT-5 Nano, GPT-5.1 Omni (ChatGPT Plus/Pro subscription)API: openai-codex-responses (Codex Responses API)Features: Extended reasoning, session-based caching, WebSocket supportSetup: See OAuth Authentication

GitHub Copilot

Models: GPT-4o, Claude 3.5 Sonnet (Copilot subscription)API: openai-completions (OpenAI-compatible)Setup: See OAuth AuthenticationNote: If you get “model not supported” error, enable the model in VS Code Copilot Chat settings first.

Google Gemini CLI

Models: Gemini 2.0 Flash, Gemini 2.5 Flash (via Cloud Code Assist)API: google-gemini-cli (Cloud Code Assist API)Features: Free tier or paid subscription, thinking supportSetup: See OAuth Authentication

Antigravity

Models: Free Gemini 3, Claude, GPT-OSS models (via Google Cloud)API: openai-completions (OpenAI-compatible)Setup: See OAuth Authentication

Vertex AI

Models: Gemini models via Google Cloud Vertex AIAPI: google-vertex (Vertex AI API)Setup: Uses Application Default Credentials (ADC)

# Local development
gcloud auth application-default login
export GOOGLE_CLOUD_PROJECT="my-project"
export GOOGLE_CLOUD_LOCATION="us-central1"

# Production (service account)
export GOOGLE_APPLICATION_CREDENTIALS="/path/to/service-account.json"

See OAuth Authentication for details.

Environment Variables

In Node.js, API keys can be set via environment variables:

Provider	Environment Variable
OpenAI	`OPENAI_API_KEY`
Azure OpenAI	`AZURE_OPENAI_API_KEY`, `AZURE_OPENAI_BASE_URL` or `AZURE_OPENAI_RESOURCE_NAME`
Anthropic	`ANTHROPIC_API_KEY` or `ANTHROPIC_OAUTH_TOKEN`
Google	`GEMINI_API_KEY`
Vertex AI	`GOOGLE_CLOUD_PROJECT`, `GOOGLE_CLOUD_LOCATION` (+ ADC)
Mistral	`MISTRAL_API_KEY`
Groq	`GROQ_API_KEY`
Cerebras	`CEREBRAS_API_KEY`
xAI	`XAI_API_KEY`
OpenRouter	`OPENROUTER_API_KEY`
Vercel AI Gateway	`AI_GATEWAY_API_KEY`
MiniMax	`MINIMAX_API_KEY`
Kimi	`KIMI_API_KEY`
GitHub Copilot	`COPILOT_GITHUB_TOKEN`, `GH_TOKEN`, or `GITHUB_TOKEN`

Checking API Keys

import { getEnvApiKey } from '@mariozechner/pi-ai';

const key = getEnvApiKey('openai');
if (key) {
  console.log('OpenAI API key is set');
}

Browser Usage

In browsers, you must pass API keys explicitly:

const response = await complete(model, context, {
  apiKey: 'your-api-key'
});

Exposing API keys in frontend code is dangerous. Use a backend proxy for production applications.

Custom Models

Create custom models for local inference servers:

import { Model, stream } from '@mariozechner/pi-ai';

// Ollama example
const ollamaModel: Model<'openai-completions'> = {
  id: 'llama-3.1-8b',
  name: 'Llama 3.1 8B (Ollama)',
  api: 'openai-completions',
  provider: 'ollama',
  baseUrl: 'http://localhost:11434/v1',
  reasoning: false,
  input: ['text'],
  cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
  contextWindow: 128000,
  maxTokens: 32000
};

const response = await stream(ollamaModel, context, {
  apiKey: 'dummy' // Ollama doesn't need a real key
});

OpenAI Compatibility Settings

For OpenAI-compatible providers, you can override compatibility settings:

const customModel: Model<'openai-completions'> = {
  id: 'custom-model',
  name: 'Custom Model',
  api: 'openai-completions',
  provider: 'custom',
  baseUrl: 'http://localhost:8000/v1',
  reasoning: false,
  input: ['text'],
  cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
  contextWindow: 8192,
  maxTokens: 4096,
  compat: {
    supportsStore: false,
    supportsDeveloperRole: false,
    maxTokensField: 'max_tokens',
    supportsUsageInStreaming: false
  }
};

Available compatibility flags:

supportsStore - Whether provider supports the store field
supportsDeveloperRole - Whether provider supports developer role vs system
supportsReasoningEffort - Whether provider supports reasoning_effort
supportsUsageInStreaming - Whether provider supports token usage in streaming
maxTokensField - Use max_completion_tokens or max_tokens
requiresToolResultName - Whether tool results need the name field
requiresAssistantAfterToolResult - Whether assistant message is required after tool results
requiresThinkingAsText - Whether thinking blocks must be text with <thinking> tags
requiresMistralToolIds - Whether tool IDs must be Mistral format (9 alphanumeric chars)
thinkingFormat - Reasoning param format: openai, zai, or qwen
supportsStrictMode - Whether provider supports strict in tool definitions

Provider APIs

The library uses different APIs for different providers:

openai-completions - OpenAI Chat Completions API (xAI, Groq, Cerebras, Mistral, OpenRouter, etc.)
openai-responses - OpenAI Responses API (OpenAI)
openai-codex-responses - OpenAI Codex Responses API (OpenAI Codex)
azure-openai-responses - Azure OpenAI Responses API (Azure)
anthropic-messages - Anthropic Messages API (Anthropic)
google-generative-ai - Google Generative AI API (Google)
google-gemini-cli - Google Cloud Code Assist API (Google Gemini CLI)
google-vertex - Google Vertex AI API (Vertex AI)
bedrock-converse-stream - Amazon Bedrock Converse API (Bedrock)

Get Started

Core Concepts

Coding Agent

LLM API

Agent Core

UI Libraries

Additional Tools

Guides

LLM Providers

LLM Providers

Supported Providers

API Key Providers

OAuth Providers

Environment Variables

Checking API Keys

Browser Usage

Custom Models

OpenAI Compatibility Settings

Provider APIs

Next Steps

OAuth Setup

Streaming

Build docs developers (and LLMs) love

Get Started

Core Concepts

Coding Agent

LLM API

Agent Core

UI Libraries

Additional Tools

Guides

Documentation Index

​LLM Providers

​Supported Providers

​API Key Providers

​OAuth Providers

​Environment Variables

​Checking API Keys

​Browser Usage

​Custom Models

​OpenAI Compatibility Settings

​Provider APIs

​Next Steps

OAuth Setup

Streaming

Build docs developers (and LLMs) love

LLM Providers

Supported Providers

API Key Providers

OAuth Providers

Environment Variables

Checking API Keys

Browser Usage

Custom Models

OpenAI Compatibility Settings

Provider APIs

Next Steps