Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/simonw/LLM/llms.txt

Use this file to discover all available pages before exploring further.

LLM ships with a built-in plugin for OpenAI’s API covering chat models, reasoning models, completion models, and embedding models. gpt-4o-mini (aliased to 4o-mini) is the default model used when you do not specify one.

Configuration

All OpenAI models require an API key. Obtain one from the OpenAI API keys page, then configure LLM:
llm keys set openai
Paste your key at the prompt. LLM stores it securely and passes it to the API automatically. You can also set the OPENAI_API_KEY environment variable instead.

OpenAI Language Models

Run llm models for the full list. The OpenAI models built into LLM are:

GPT-4o Series

Model IDAliasesNotes
gpt-4o4oFlagship multimodal model; images + PDFs
chatgpt-4o-latestchatgpt-4oAlways points to the latest ChatGPT-4o
gpt-4o-mini4o-miniDefault model; least expensive
gpt-4o-audio-previewAccepts audio (MP3, WAV)
gpt-4o-audio-preview-2024-12-17Dated snapshot
gpt-4o-audio-preview-2024-10-01Dated snapshot
gpt-4o-mini-audio-previewMini audio variant
gpt-4o-mini-audio-preview-2024-12-17Dated snapshot

GPT-4.1 Series

Model IDAliases
gpt-4.14.1
gpt-4.1-mini4.1-mini
gpt-4.1-nano4.1-nano

GPT-4 Series (Legacy)

Model IDAliases
gpt-44, gpt4
gpt-4-32k4-32k
gpt-4-turbogpt-4-turbo-preview, 4-turbo, 4t
gpt-4-turbo-2024-04-09
gpt-4-1106-preview
gpt-4-0125-preview
gpt-4.5-previewgpt-4.5
gpt-4.5-preview-2025-02-27

GPT-3.5 Series (Legacy)

Model IDAliases
gpt-3.5-turbo3.5, chatgpt
gpt-3.5-turbo-16kchatgpt-16k, 3.5-16k

o-Series Reasoning Models

These use the Responses API endpoint and support a reasoning_effort option (low, medium, high):
Model IDAliasesStreaming
o1
o1-2024-12-17
o1-preview
o1-mini
o3-mini
o3
o4-mini

GPT-5 Series

Model IDNotes
gpt-5Flagship
gpt-5-mini
gpt-5-nano
gpt-5-2025-08-07Dated snapshot
gpt-5-mini-2025-08-07Dated snapshot
gpt-5-nano-2025-08-07Dated snapshot
gpt-5.1, gpt-5.1-chat-latest
gpt-5.2, gpt-5.2-chat-latest
gpt-5.4, gpt-5.4-mini, gpt-5.4-nano
gpt-5.5

Model Features

FeatureChat modelso-series (Responses)Completion
StreamingVaries
Async
Image attachmentsSelected modelsSelected models
PDF attachmentsSelected modelsSelected models
SchemasSelected models
ToolsSelected models
Run llm models --options to see the exact set of features and accepted attachment types for each model.
System prompts — OpenAI models are particularly responsive to system prompts as high-weight instructions. Attachments — Models such as gpt-4o accept images and PDFs via -a. Use llm models --options to see which types a specific model accepts. Schemas — Models with schema support return JSON matching a provided JSON Schema. See the schemas documentation. Options — Common options include temperature, max_tokens, top_p, frequency_penalty, presence_penalty, seed, and json_object. Reasoning models also expose reasoning_effort.

OpenAI Embedding Models

Run llm embed-models to list embedding models. LLM includes the following OpenAI embedding models:
ModelAlias(es)Vector size
text-embedding-ada-002ada, ada-0021536
text-embedding-3-small3-small1536
text-embedding-3-large3-large3072
text-embedding-3-small-5123-small-512512
text-embedding-3-large-2563-large-256256
text-embedding-3-large-10243-large-10241024
The 3-small model is the most cost-effective. The -512, -256, and -1024 variants are pre-truncated versions of the 3-small and 3-large models — smaller vectors reduce storage and query cost with a modest accuracy trade-off.

OpenAI Completion Models (Legacy)

gpt-3.5-turbo-instruct (aliases: 3.5-instruct, chatgpt-instruct) is a completion model designed to continue text rather than respond to chat-style messages.
llm -m gpt-3.5-turbo-instruct 'Reasons to tame a wild beaver:'
It supports the -o logprobs N option (unavailable on chat models), which stores the N most likely token log-probabilities in the database for each returned token.

Adding More OpenAI Models

OpenAI releases new model names regularly. While LLM ships updates to cover them, you can also add models immediately via a configuration file.
1

Find the config directory

dirname "$(llm logs path)"
On macOS this is typically ~/Library/Application Support/io.datasette.llm.
2

Create extra-openai-models.yaml

Create a file called extra-openai-models.yaml in that directory.
3

Add a model entry

- model_id: gpt-3.5-turbo-0613
  model_name: gpt-3.5-turbo-0613
  aliases: ["0613"]
  • model_id — the identifier stored in LLM’s logs and used on the command line
  • model_name — the exact string passed to the OpenAI API
  • aliases — optional short names
Additional flags:
KeyDescription
completion: trueUse the completions endpoint instead of chat
supports_schema: trueModel supports JSON schema output
reasoning: trueReasoning model (like o1, o3-mini)
4

Verify the model is available

llm models
llm -m 0613 'What is the capital of France?'
o1-pro is not accessible through the Chat Completions API used by LLM’s default plugin. Install llm-openai-plugin to access it.

Build docs developers (and LLMs) love