AnythingLLM ships with connectors for more than 30 language model providers — from fully local runtimes such as Ollama and LM Studio to managed cloud APIs like OpenAI, Anthropic, and AWS Bedrock. You can switch providers at any time without re-importing documents; only new chat completions will use the updated provider. The active provider is controlled by theDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/Mintplex-Labs/anything-llm/llms.txt
Use this file to discover all available pages before exploring further.
LLM_PROVIDER environment variable (or selected in the setup wizard), and each provider then reads its own set of keys and preferences from the environment.
Two Ways to Configure
- Setup Wizard (UI)
- Environment Variables
When you first launch AnythingLLM, the setup wizard walks you through selecting a provider and entering credentials directly in the browser. You can return to these settings at any time via Settings → LLM Preference in the sidebar.The UI writes your choices to the database, so no
.env edits are required for basic provider selection when using the wizard.Local / Self-Hosted Providers
These providers run entirely on your own hardware — no data is sent to external services.Ollama
Ollama
Ollama is the most popular self-hosted LLM runtime and the recommended starting point for local deployments.When running AnythingLLM inside Docker, use
host.docker.internal to reach Ollama on the host machine. On Linux you may need --add-host=host.docker.internal:host-gateway in your Docker run command.LM Studio
LM Studio
LM Studio exposes an OpenAI-compatible local API server.
LocalAI
LocalAI
LocalAI is an open-source, self-hosted inference server compatible with the OpenAI API.
KoboldCPP
KoboldCPP
KoboldCPP is a single-file LLM runtime popular in the creative writing community.
Text Generation Web UI (llama.cpp / oobabooga)
Text Generation Web UI (llama.cpp / oobabooga)
Supports any model loaded in the popular oobabooga text-generation-webui.
NVIDIA NIM
NVIDIA NIM
NVIDIA NIM exposes locally deployed NVIDIA-accelerated models through an OpenAI-compatible endpoint.
Docker Model Runner
Docker Model Runner
Docker Model Runner runs models directly inside the Docker engine on supported hardware.
Lemonade
Lemonade
Lemonade is an AMD-optimized local inference server (part of the AMD ROCm ecosystem).
Foundry (Microsoft Foundry Local)
Foundry (Microsoft Foundry Local)
Microsoft Foundry Local is a local inference runtime for Windows with hardware acceleration.
PrivateMode
PrivateMode
PrivateMode is a privacy-focused local inference server.
Cloud / Commercial Providers
- OpenAI
- Anthropic
- Google Gemini
OPEN_MODEL_PREF can be any model ID returned by the OpenAI models API. Popular choices include gpt-4o, gpt-4o-mini, o1, and o3-mini.Azure OpenAI
Azure OpenAI
AZURE_OPENAI_MODEL_PREF is the deployment name you created in Azure OpenAI Studio, not the base model name.AWS Bedrock
AWS Bedrock
Mistral
Mistral
Groq
Groq
Cohere
Cohere
Perplexity
Perplexity
Together AI
Together AI
Fireworks AI
Fireworks AI
OpenRouter
OpenRouter
DeepSeek
DeepSeek
xAI (Grok)
xAI (Grok)
Novita
Novita
SambaNova
SambaNova
Cerebras
Cerebras
MiniMax
MiniMax
Moonshot AI
Moonshot AI
Gitee AI
Gitee AI
ZAI (Zhipu AI)
ZAI (Zhipu AI)
PPIO
PPIO
APIPie
APIPie
CometAPI
CometAPI
Generic / Compatible Providers
Use these when you have any OpenAI-compatible endpoint that doesn’t have its own named connector, or when routing through an aggregation layer.Generic OpenAI (any OpenAI-compatible endpoint)
Generic OpenAI (any OpenAI-compatible endpoint)
LiteLLM
LiteLLM
LiteLLM acts as a universal proxy, translating dozens of provider APIs into a single OpenAI-compatible interface.
Provider Quick-Reference
| Provider key | Category | Auth variable |
|---|---|---|
openai | Cloud | OPEN_AI_KEY |
anthropic | Cloud | ANTHROPIC_API_KEY |
gemini | Cloud | GEMINI_API_KEY |
azure | Cloud | AZURE_OPENAI_KEY |
bedrock | Cloud | AWS_BEDROCK_API_KEY |
mistral | Cloud | MISTRAL_API_KEY |
groq | Cloud | GROQ_API_KEY |
cohere | Cloud | COHERE_API_KEY |
perplexity | Cloud | PERPLEXITY_API_KEY |
togetherai | Cloud | TOGETHER_AI_API_KEY |
fireworksai | Cloud | FIREWORKS_AI_LLM_API_KEY |
openrouter | Cloud | OPENROUTER_API_KEY |
deepseek | Cloud | DEEPSEEK_API_KEY |
xai | Cloud | XAI_LLM_API_KEY |
novita | Cloud | NOVITA_LLM_API_KEY |
sambanova | Cloud | SAMBANOVA_LLM_API_KEY |
cerebras | Cloud | CEREBRAS_API_KEY |
minimax | Cloud | MINIMAX_API_KEY |
moonshotai | Cloud | MOONSHOT_AI_API_KEY |
zai | Cloud | ZAI_API_KEY |
ppio | Cloud | PPIO_API_KEY |
apipie | Cloud | APIPIE_LLM_API_KEY |
cometapi | Cloud | COMETAPI_LLM_API_KEY |
giteeai | Cloud | GITEE_AI_API_KEY |
ollama | Local | (none required) |
lmstudio | Local | (none required) |
localai | Local | LOCAL_AI_API_KEY |
koboldcpp | Local | (none required) |
textgenwebui | Local | TEXT_GEN_WEB_UI_API_KEY |
nvidia-nim | Local | (none required) |
docker-model-runner | Local | (none required) |
lemonade | Local | (none required — LEMONADE_LLM_API_KEY optional) |
foundry | Local | (none required) |
privatemode | Local | (none required) |
generic-openai | Generic | GENERIC_OPEN_AI_API_KEY |
litellm | Generic | LITE_LLM_API_KEY |
The
LLM_PROVIDER value set via environment variables takes precedence over any selection stored in the database. If you want the UI to control the provider, remove LLM_PROVIDER from your .env file.