AnythingLLM is the all-in-one AI application that lets you build a private, fully-featured AI assistant without compromises. Connect your favorite local or cloud LLM, ingest your documents, and start chatting in minutes. Out of the box you get built-in AI agents, multi-user support, a choice of vector databases, and document pipelines — no extra configuration required. Whether you’re a solo developer running everything locally or a team deploying to the cloud, AnythingLLM adapts to your workflow and infrastructure.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/Mintplex-Labs/anything-llm/llms.txt
Use this file to discover all available pages before exploring further.
Quickstart
Get AnythingLLM running in under 5 minutes with Docker and send your first chat.
Docker Installation
Deploy with a single
docker run command or Docker Compose for full control.Desktop App
Download the native desktop app for Mac, Windows, or Linux — no Docker needed.
Bare Metal
Run directly on Linux or macOS using Node.js and Yarn without any containers.
What Problem Does AnythingLLM Solve?
Most AI chat tools either lock you into a single cloud provider, expose your data to third-party servers, or require complex infrastructure to self-host. AnythingLLM removes every one of those constraints. You choose the LLM — local or remote. You own the data. You control who has access. The platform handles document ingestion, vector storage, agent capabilities, and multi-user permissioning so you don’t have to wire anything together yourself.Key Features
Retrieval-Augmented Generation (RAG) Chat Upload PDFs, Word documents, text files, and more directly into workspaces. AnythingLLM chunks, embeds, and indexes them so the LLM can answer questions grounded in your actual content — with source citations shown in every response. AI Agents Agents can browse the web, run code, read files, and execute multi-step workflows — all from within a chat interface. AnythingLLM includes a no-code agent flow builder and supports MCP (Model Context Protocol) for connecting external tools. Multi-User Support with Permissions The Docker version supports multiple users with per-user access controls. Admins can define who can create workspaces, upload documents, manage API keys, and more — without compromising security or privacy. 30+ LLM Providers AnythingLLM works with every major language model provider out of the box, including OpenAI, Anthropic, Google Gemini, Ollama, LM Studio, Mistral, Groq, AWS Bedrock, Azure OpenAI, and many more. Switching providers requires no code changes — just update your environment variables or use the settings UI. 10 Vector Databases The default vector store is LanceDB, which runs embedded with zero setup. You can swap to Pinecone, Chroma, ChromaCloud, Weaviate, Qdrant, Milvus, Zilliz, Astra DB, or PGVector at any time. Embeddable Chat Widget Embed a white-labeled chat widget on any website with a single script tag. The widget connects to your AnythingLLM instance and workspace (Docker version only). Desktop App A standalone desktop application for Mac, Windows, and Linux bundles everything together — including local model support — with no Docker or command-line setup required. Ideal for individual users who want a private AI assistant without touching infrastructure. Additional Highlights- Dynamic model routing — auto-route chats to the best model based on your rules
- Scheduled tasks — run recurring prompts or agent workflows on a cron schedule
- Automatic and user-managed memories — let the LLM remember context across sessions
- Multi-modal support for both open-source and closed LLMs
- Full Developer API for custom integrations
- Telemetry is anonymous and opt-out
Architecture Overview
AnythingLLM is a monorepo with three runtime services and three supporting modules:| Service | Technology | Role |
|---|---|---|
frontend | Vite + React | Browser UI for chat, workspaces, document management, and settings |
server | Node.js + Express | API server handling LLM calls, vector DB management, auth, and business logic |
collector | Node.js + Express | Separate process that parses and processes uploaded documents |
docker— Dockerfile,docker-compose.yml, and.env.examplefor containerized deploymentembed— Submodule for the embeddable website chat widgetbrowser-extension— Submodule for the Chrome browser extension
server/public, so only two processes run: server and collector.
Supported Providers at a Glance
LLM Providers
OpenAI, Anthropic, Google Gemini, Ollama, LM Studio, Mistral, Groq, AWS Bedrock, Azure OpenAI, DeepSeek, Cohere, Together AI, Fireworks AI, Perplexity, OpenRouter, xAI, and more — 30+ total.
Vector Databases
LanceDB (default, embedded), Pinecone, Chroma, ChromaCloud, Weaviate, Qdrant, Milvus, Zilliz, Astra DB, PGVector — 10 options.
Embedding Models
Native built-in embedder (default), OpenAI, Azure OpenAI, Ollama, LM Studio, Cohere, Voyage AI, Mistral, Gemini, LiteLLM, and generic OpenAI-compatible APIs.
Audio & TTS
Built-in Whisper transcription, OpenAI Whisper, native browser TTS, OpenAI TTS, ElevenLabs, Kokoro, PiperTTS, and OpenAI-compatible TTS services.