The LLM Proxy is Archestra’s security layer that sits between your AI agents, chatbots, and applications and the LLM providers they rely on — OpenAI, Anthropic, Google Gemini, AWS Bedrock, and more. Every request passes through the proxy, where Archestra intercepts and analyzes it, applies security policies, and then forwards it to the upstream provider. Responses travel the same path in reverse, giving you a single control plane for visibility, compliance, and cost management across all LLM traffic.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/archestra-ai/archestra/llms.txt
Use this file to discover all available pages before exploring further.
How It Works
Security Policies
Every request is evaluated against your configured security and compliance policies before reaching the provider.
Unified Endpoint
One proxy URL per provider route replaces direct provider endpoints in your client code, requiring minimal changes.
Full Observability
All interactions are logged with token counts, costs, latency, and trace attributes for complete audit trails.
Getting Started
Create an LLM Proxy
Go to LLM Proxies in the Archestra dashboard and create a new proxy. Give it a descriptive name to identify the project or team it serves.
Copy the Provider URL
Click the Connect icon on the proxy, choose the LLM provider your application uses, and copy the provided proxy URL.
OpenAI-Compatible Model Router
The Model Router exposes a single OpenAI-compatible endpoint that can reach models from multiple configured providers. Use it when your application supports the OpenAI API style but you want to route to Anthropic, Gemini, Groq, Bedrock, or other providers without changing client code. The router accepts OpenAI Responses and Chat Completions requests, resolves provider-qualified model IDs such asopenai:gpt-5.4 or anthropic:claude-opus-4-6-20250918 to the backing provider, runs the full LLM Proxy security pipeline, and returns a matching OpenAI-format response.
Generic OpenAI-compatible clients should use a virtual key mapped to the providers they need. Backend services and bots should use an LLM OAuth client access token. See Authentication for setup details.
Custom Headers
Archestra supports optional custom headers that enrich logs, metrics, and traces without altering the underlying LLM request. All headers are optional and can be combined freely.| Header | Description | Example Value |
|---|---|---|
X-Archestra-Agent-Id | Client-provided identifier for the calling agent or application. Stored with each interaction and included in Prometheus metrics as the external_agent_id label. Useful when multiple apps share the same proxy. | my-chatbot-prod |
X-Archestra-User-Id | Associates the request with a specific Archestra user. Automatically included when using built-in Archestra Chat. | 123e4567-e89b-12d3-a456-426614174000 |
X-Archestra-Session-Id | Groups related LLM requests into a session — included in trace attributes as gen_ai.conversation.id. | session-abc-123 |
X-Archestra-Execution-Id | Associates the request with a specific execution run. Used for the agent_executions_total Prometheus metric. | exec-run-456 |
X-Archestra-Meta | Composite header combining agent ID, execution ID, and session ID in one value. Format: {agent-id}/{execution-id}/{session-id}. Any segment can be empty. Individual headers take precedence. Values must not contain /. | my-agent/exec-123/session-456 |