Containerizing agents with Docker for production

Docker solves the most persistent problem in AI agent deployment: the gap between “works on my machine” and “works in production.” By packaging your agent, its Python version, and every dependency into a single image, you guarantee identical behavior across your laptop, a CI runner, and a cloud server. This page walks you through the full workflow — from writing your first Dockerfile to running a dynamic, environment-variable-driven agent container.

Reproducibility

One image runs identically on any host, eliminating version-conflict surprises.

Portability

Ship the same container to AWS, GCP, Kubernetes, or a colleague’s machine.

Separation of concerns

Keep credentials out of code by injecting them as environment variables at runtime.

Prerequisites

Docker Desktop installed and running
A Docker Hub account (for pushing and pulling images)
Verify Docker is running:

docker --version
# Docker version 28.0.4, build b8034c0

Core Dockerfile instructions

A Dockerfile is the image’s recipe. Each instruction either creates a new layer or adds metadata:

Instruction	Purpose
`FROM`	Base image (OS + runtime)
`RUN`	Execute shell commands during build
`COPY`	Copy files from host into image
`ENV`	Set default environment variables
`ARG`	Build-time parameters
`WORKDIR`	Set working directory inside image
`EXPOSE`	Document the port your service listens on
`CMD`	Default command when the container starts

FROM, RUN, and COPY create cacheable layers. LABEL, ENV, ARG, and CMD add metadata only.

Building and running your first image

Write a minimal Dockerfile

Start from the official Python 3.10 base image and install a system dependency:

examples/ex1/Dockerfile

FROM python:3.10

LABEL example=1

ENV PYTHON_VER=3.10

RUN apt-get update && apt-get install -y --no-install-recommends curl

Build the image

Run this from the repository root. The -f flag points to the Dockerfile; -t sets the name and tag:

docker build . -f ./examples/ex1/Dockerfile -t rkrispin/vscode-python:ex1

Validate the build succeeded:

docker images
# REPOSITORY               TAG   IMAGE ID       CREATED        SIZE
# rkrispin/vscode-python   ex1   a8e4c6d06c97   43 hours ago   1.02GB

Run the container interactively

docker run --interactive --tty python:3.10

You land inside a Python REPL running inside the container.

Layer caching

Docker caches every layer. When you rebuild, only layers below the first changed instruction are rebuilt. Order your Dockerfile from least-changed (base image, system packages) to most-changed (application code) to maximize cache reuse.

FROM python:3.10

LABEL example=2

ENV PYTHON_VER=3.10

RUN apt-get update && apt-get install -y --no-install-recommends curl

RUN apt-get update && apt-get install -y --no-install-recommends vim

Put COPY requirements.txt and RUN pip install before COPY . . so dependency installation is cached across code-only changes.

Volume mounts

Containers are ephemeral by default — any file written inside them disappears when the container stops. Mount a local directory with -v to persist data or inject code at runtime:

docker run -v ./examples/ex3:/ex3 --interactive --tty rkrispin/vscode-python:ex3

Inside the container:

root@8c94db531eb4:/# cd ex3
root@8c94db531eb4:/ex3# ls
Dockerfile  hello-world.py
root@8c94db531eb4:/ex3# python hello-world.py
Hello World!

This pattern is useful for mounting local model weights into a container without baking large files into the image.

Dockerizing an AI agent

The following example containers a minimal OpenAI agent. The agent script contains no credentials — the API key arrives at runtime through --env.

Agent script
Dockerfile

examples/ex4/simple_agent.py

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "What is the capital of France?"}],
)

print(response.choices[0].message.content)

examples/ex4/Dockerfile

FROM python:3.10-slim

RUN pip install --no-cache-dir openai

COPY ./examples/ex4/simple_agent.py /app/simple_agent.py

CMD ["python", "/app/simple_agent.py"]

Build and run:

docker build . -f ./examples/ex4/Dockerfile -t rkrispin/simple_agent:ex4

docker run --env OPENAI_API_KEY=$OPENAI_API_KEY rkrispin/simple_agent:ex4
# The capital of France is Paris.

Never bake secrets into an image with ENV OPENAI_API_KEY=sk-.... Anyone who pulls the image gains access to the key. Always pass secrets at runtime with --env or a secrets manager.

Dynamic agents via environment variables

Make the agent reusable by reading the question from an environment variable:

import os
from openai import OpenAI

client = OpenAI()

question = os.getenv("QUESTION", "What is the capital of France?")

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": question}],
)

print(f"Question: {question}")
print(f"Answer: {response.choices[0].message.content}")

Run with a custom question:

docker build . -f ./examples/ex5/Dockerfile -t rkrispin/dynamic_agent:ex5

docker run \
  --env OPENAI_API_KEY=$OPENAI_API_KEY \
  --env QUESTION="What is the capital of United Kingdom" \
  rkrispin/dynamic_agent:ex5
# Question: What is the capital of United Kingdom
# Answer: The capital of the United Kingdom is London.

Next steps

Deploy on RunPod GPU

Push your Docker image to RunPod’s serverless GPU infrastructure for scalable inference.

Run LLMs locally with Ollama

Mount open-weight models into a container for fully offline, on-premises deployments.

Get Started

Agent Frameworks

Memory & Knowledge

Tool Integration & Data

Deployment

Observability & Quality

Containerizing agents with Docker for production

Reproducibility

Portability

Separation of concerns

Prerequisites

Core Dockerfile instructions

Building and running your first image

Layer caching

Volume mounts

Dockerizing an AI agent

Dynamic agents via environment variables

Next steps

Deploy on RunPod GPU

Run LLMs locally with Ollama

Build docs developers (and LLMs) love

Get Started

Agent Frameworks

Memory & Knowledge

Tool Integration & Data

Deployment

Observability & Quality

Documentation Index

Reproducibility

Portability

Separation of concerns

​Prerequisites

​Core Dockerfile instructions

​Building and running your first image

​Layer caching

​Volume mounts

​Dockerizing an AI agent

​Dynamic agents via environment variables

​Next steps

Deploy on RunPod GPU

Run LLMs locally with Ollama

Build docs developers (and LLMs) love

Prerequisites

Core Dockerfile instructions

Building and running your first image

Layer caching

Volume mounts

Dockerizing an AI agent

Dynamic agents via environment variables

Next steps