Plugin Directory: All LLM Plugins Listed by Category

The plugins below extend LLM with additional models, tools, embedding backends, CLI commands, and content loaders. Install any of them with llm install <package-name> — see Installing Plugins for details. Plugins are grouped by the kind of capability they add.

Local models

These plugins let you run LLMs directly on your own machine without sending data to an external API.

Plugin	Description
llm-ollama	Adds support for local models managed by Ollama.
llm-mlx	(Mac only) Uses Apple’s MLX framework for extremely fast access to a wide range of local models.
llm-gguf	Uses llama.cpp to run any model published in the GGUF format.
llm-llamafile	Connects to local models running via llamafile.
llm-gpt4all	Runs GPT4All optimised models locally, including Vicuna, Orca, Falcon, and MPT variants.
llm-mlc	Runs models released by the MLC project, with GPU acceleration on Apple Silicon.
llm-mpt30b	Adds support for the MPT-30B local model.

Example — install and use Ollama models:

llm install llm-ollama
llm models
llm -m ollama/llama3 'Explain how neural networks work'

Example — install and use a GGUF model:

llm install llm-gguf
llm gguf download-model https://example.com/model.gguf
llm -m orca-mini-3b-gguf2-q4_0 'What is the capital of France?'

Remote APIs

These plugins connect LLM to externally hosted model APIs. You’ll need an API key for each service.

Plugin	Provider
llm-anthropic	Anthropic — Claude 3 family, Claude 3.5 Sonnet, and beyond
llm-gemini	Google — Gemini language and embedding models
llm-mistral	Mistral AI — language and embedding models
llm-groq	Groq — ultra-fast inference
llm-grok	xAI — Grok models via the xAI API
llm-command-r	Cohere — Command R and Command R Plus
llm-cohere	Cohere — `cohere-generate` and `cohere-summarize` models
llm-deepseek	DeepSeek — DeepSeek-Chat and DeepSeek-Coder
llm-perplexity	Perplexity Labs — online search-augmented models
llm-reka	Reka AI — the full Reka model family
llm-replicate	Replicate — Llama 2 and other hosted models
llm-openrouter	OpenRouter — unified access to many providers
llm-fireworks	Fireworks AI — fast open-model inference
llm-together	Together AI — large catalogue of open licensed models
llm-bedrock	Amazon Bedrock — Nova models by Amazon
llm-bedrock-anthropic	Amazon Bedrock — Claude and Claude Instant via Bedrock
llm-bedrock-meta	Amazon Bedrock — Llama 2 and Llama 3 via Bedrock
llm-anyscale-endpoints	Anyscale Endpoints — Llama 2 70B and more
llm-lambda-labs	Lambda Labs — Nous Hermes 3 series
llm-venice	Venice AI — privacy-focused, uncensored models including Llama 3.1 405B

Example — install and use Anthropic’s Claude:

llm install llm-anthropic
llm keys set anthropic
llm -m claude-3-5-sonnet-latest 'Summarise the history of the internet'

If a provider exposes an OpenAI-compatible API, you can often configure LLM to use it directly without installing a dedicated plugin.

Tools

Tool plugins expose callable functions that a model can invoke during a conversation — for example, running code, querying a database, or searching the web.

Plugin	What it provides
llm-tools-simpleeval	Safe expression evaluation for maths and simple logic
llm-tools-quickjs	Sandboxed QuickJS JavaScript interpreter with persistent state between calls
llm-tools-sqlite	Read-only SQL queries against local SQLite databases
llm-tools-datasette	SQL queries against a remote Datasette instance
llm-tools-exa	Web search and question-answering via exa.ai
llm-tools-rag	Semantic search over your LLM embedding collections for simple RAG

Example — let a model run JavaScript:

llm install llm-tools-quickjs
llm -T llm_tools_quickjs 'Calculate the first 10 Fibonacci numbers'

Fragments and template loaders

Introduced in LLM 0.24, these plugins add custom -f prefix:value fragment loaders and -t prefix:value template loaders, making it easy to pull external content directly into a prompt.Fragment loaders (-f prefix:value)

Plugin	What it loads
llm-fragments-github	Entire GitHub repositories or individual issue threads
llm-fragments-pdf	PDF files converted to Markdown via PyMuPDF4LLM
llm-fragments-pypi	PyPI package descriptions and metadata
llm-fragments-site-text	Websites converted to Markdown via Trafilatura
llm-fragments-reader	Any URL processed through the Jina Reader API
llm-hacker-news	Hacker News threads imported as conversation fragments
llm-video-frames	Video files converted to JPEG frames via `ffmpeg` for vision models

Template loaders (-t prefix:value)

Plugin	What it loads
llm-templates-github	Prompt templates shared on GitHub
llm-templates-fabric	Prompts from the Fabric collection

Examples:

# Summarise an entire GitHub repo
llm install llm-fragments-github
llm -f github:simonw/files-to-prompt 'explain this codebase'

# Load a GitHub issue thread
llm -f issue:https://github.com/simonw/llm/issues/1 'summarise this discussion'

# Ask a question about a PDF
llm install llm-fragments-pdf
llm -f pdf:report.pdf "what are the key findings?"

# Use a Fabric prompt template
llm install llm-templates-fabric
cat setup.py | llm -t fabric:explain_code

# Extract frames from a video and describe it
llm install llm-video-frames
llm -f video-frames:demo.mp4 'describe the key scenes in this video'

Embedding models

Embedding model plugins add backends for generating vector embeddings — used with llm embed and the llm embed-multi / llm similar commands.

Plugin	Models provided
llm-sentence-transformers	Any model from the sentence-transformers library
llm-clip	OpenAI’s CLIP model — embeds images and text in the same vector space
llm-embed-jina	Jina AI’s 8K text embedding models
llm-embed-onnx	Seven embedding models running via the ONNX runtime

Example — embed and search text with sentence-transformers:

llm install llm-sentence-transformers
llm embed-models          # list available embedding models
llm embed -m sentence-transformers/all-MiniLM-L6-v2 -c "Hello world"

Example — image search with CLIP:

llm install llm-clip
llm embed -m clip -c "a photo of a cat"

Extra commands

These plugins bolt new subcommands onto the llm CLI.

Plugin	Command added	What it does
llm-cmd	`llm cmd`	Generates a shell command from a prompt, pre-fills your shell input, and lets you review before running
llm-cmd-comp	Shell keybinding	Launches an interactive chat to build a command; inserts the result into your shell command line
llm-python	`llm python`	Opens a Python REPL inside LLM’s virtual environment — handy for scripting and debugging
llm-cluster	`llm cluster`	Clusters a collection of embeddings and generates summary labels using an LLM
llm-jq	`llm jq`	Pipes JSON and a natural-language description into `jq` and executes the generated program

Example — generate and run a shell command:

llm install llm-cmd
llm cmd 'find all PDF files modified in the last 7 days'
# Review the generated command, then press Enter to run or Ctrl+C to cancel

Example — cluster embeddings:

llm install llm-cluster
llm cluster my-collection 10

Just for fun

Experimental or novelty plugins — also useful as learning resources for plugin authors.

Plugin	Description
llm-markov	A simple model that generates text using a Markov chain. Used as the example in the Writing a plugin to support a new model tutorial.

llm install llm-markov
llm -m markov 'hello world'

Get Started

Using LLM

Advanced Features

Embeddings

Plugins

Plugin Directory: All LLM Plugins Listed by Category

Build docs developers (and LLMs) love

Get Started

Using LLM

Advanced Features

Embeddings

Plugins

Documentation Index

Build docs developers (and LLMs) love