The plugins below extend LLM with additional models, tools, embedding backends, CLI commands, and content loaders. Install any of them withDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/simonw/LLM/llms.txt
Use this file to discover all available pages before exploring further.
llm install <package-name> — see Installing Plugins for details. Plugins are grouped by the kind of capability they add.
Local models
Local models
These plugins let you run LLMs directly on your own machine without sending data to an external API.
Example — install and use Ollama models:Example — install and use a GGUF model:
| Plugin | Description |
|---|---|
| llm-ollama | Adds support for local models managed by Ollama. |
| llm-mlx | (Mac only) Uses Apple’s MLX framework for extremely fast access to a wide range of local models. |
| llm-gguf | Uses llama.cpp to run any model published in the GGUF format. |
| llm-llamafile | Connects to local models running via llamafile. |
| llm-gpt4all | Runs GPT4All optimised models locally, including Vicuna, Orca, Falcon, and MPT variants. |
| llm-mlc | Runs models released by the MLC project, with GPU acceleration on Apple Silicon. |
| llm-mpt30b | Adds support for the MPT-30B local model. |
Remote APIs
Remote APIs
These plugins connect LLM to externally hosted model APIs. You’ll need an API key for each service.
Example — install and use Anthropic’s Claude:
| Plugin | Provider |
|---|---|
| llm-anthropic | Anthropic — Claude 3 family, Claude 3.5 Sonnet, and beyond |
| llm-gemini | Google — Gemini language and embedding models |
| llm-mistral | Mistral AI — language and embedding models |
| llm-groq | Groq — ultra-fast inference |
| llm-grok | xAI — Grok models via the xAI API |
| llm-command-r | Cohere — Command R and Command R Plus |
| llm-cohere | Cohere — cohere-generate and cohere-summarize models |
| llm-deepseek | DeepSeek — DeepSeek-Chat and DeepSeek-Coder |
| llm-perplexity | Perplexity Labs — online search-augmented models |
| llm-reka | Reka AI — the full Reka model family |
| llm-replicate | Replicate — Llama 2 and other hosted models |
| llm-openrouter | OpenRouter — unified access to many providers |
| llm-fireworks | Fireworks AI — fast open-model inference |
| llm-together | Together AI — large catalogue of open licensed models |
| llm-bedrock | Amazon Bedrock — Nova models by Amazon |
| llm-bedrock-anthropic | Amazon Bedrock — Claude and Claude Instant via Bedrock |
| llm-bedrock-meta | Amazon Bedrock — Llama 2 and Llama 3 via Bedrock |
| llm-anyscale-endpoints | Anyscale Endpoints — Llama 2 70B and more |
| llm-lambda-labs | Lambda Labs — Nous Hermes 3 series |
| llm-venice | Venice AI — privacy-focused, uncensored models including Llama 3.1 405B |
If a provider exposes an OpenAI-compatible API, you can often configure LLM to use it directly without installing a dedicated plugin.
Tools
Tools
Tool plugins expose callable functions that a model can invoke during a conversation — for example, running code, querying a database, or searching the web.
Example — let a model run JavaScript:
| Plugin | What it provides |
|---|---|
| llm-tools-simpleeval | Safe expression evaluation for maths and simple logic |
| llm-tools-quickjs | Sandboxed QuickJS JavaScript interpreter with persistent state between calls |
| llm-tools-sqlite | Read-only SQL queries against local SQLite databases |
| llm-tools-datasette | SQL queries against a remote Datasette instance |
| llm-tools-exa | Web search and question-answering via exa.ai |
| llm-tools-rag | Semantic search over your LLM embedding collections for simple RAG |
Fragments and template loaders
Fragments and template loaders
Introduced in LLM 0.24, these plugins add custom
Template loaders (
Examples:
-f prefix:value fragment loaders and -t prefix:value template loaders, making it easy to pull external content directly into a prompt.Fragment loaders (-f prefix:value)| Plugin | What it loads |
|---|---|
| llm-fragments-github | Entire GitHub repositories or individual issue threads |
| llm-fragments-pdf | PDF files converted to Markdown via PyMuPDF4LLM |
| llm-fragments-pypi | PyPI package descriptions and metadata |
| llm-fragments-site-text | Websites converted to Markdown via Trafilatura |
| llm-fragments-reader | Any URL processed through the Jina Reader API |
| llm-hacker-news | Hacker News threads imported as conversation fragments |
| llm-video-frames | Video files converted to JPEG frames via ffmpeg for vision models |
-t prefix:value)| Plugin | What it loads |
|---|---|
| llm-templates-github | Prompt templates shared on GitHub |
| llm-templates-fabric | Prompts from the Fabric collection |
Embedding models
Embedding models
Embedding model plugins add backends for generating vector embeddings — used with
Example — embed and search text with sentence-transformers:Example — image search with CLIP:
llm embed and the llm embed-multi / llm similar commands.| Plugin | Models provided |
|---|---|
| llm-sentence-transformers | Any model from the sentence-transformers library |
| llm-clip | OpenAI’s CLIP model — embeds images and text in the same vector space |
| llm-embed-jina | Jina AI’s 8K text embedding models |
| llm-embed-onnx | Seven embedding models running via the ONNX runtime |
Extra commands
Extra commands
These plugins bolt new subcommands onto the
Example — generate and run a shell command:Example — cluster embeddings:
llm CLI.| Plugin | Command added | What it does |
|---|---|---|
| llm-cmd | llm cmd | Generates a shell command from a prompt, pre-fills your shell input, and lets you review before running |
| llm-cmd-comp | Shell keybinding | Launches an interactive chat to build a command; inserts the result into your shell command line |
| llm-python | llm python | Opens a Python REPL inside LLM’s virtual environment — handy for scripting and debugging |
| llm-cluster | llm cluster | Clusters a collection of embeddings and generates summary labels using an LLM |
| llm-jq | llm jq | Pipes JSON and a natural-language description into jq and executes the generated program |
Just for fun
Just for fun
Experimental or novelty plugins — also useful as learning resources for plugin authors.
| Plugin | Description |
|---|---|
| llm-markov | A simple model that generates text using a Markov chain. Used as the example in the Writing a plugin to support a new model tutorial. |