Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/DenisSergeevitch/agents-best-practices/llms.txt

Use this file to discover all available pages before exploring further.

Not every agent needs to act autonomously. Granting more autonomy than your evals justify is the fastest way to introduce hard-to-audit errors into production systems. The four autonomy levels below form a progression: start at the lowest level that creates value for your use case, measure whether it is sufficient, and only move up when evidence shows you need to.
Do not start at autonomous. Always begin at the lowest autonomy level that creates demonstrable value for your use case. Move up only when evals show the simpler level is insufficient.

Answer-only

The agent reads, interprets, and responds. It takes no actions and produces no side effects. All output is text the user reads and acts on themselves.What it means: The model receives context (documents, data, conversation history) and produces a natural-language response. No tools with side effects are available. No state is written. The harness is minimal: context builder, model adapter, and response formatter.When to use it:
  • Q&A over provided documents or knowledge bases
  • Summarization and classification
  • Short drafting tasks where the user pastes or applies the output manually
  • Any case where the cost of an incorrect automated action exceeds the cost of the user reviewing and applying the answer themselves
Permission model: No permission engine required for actions. If retrieval tools are available, they should be read-only with no write scope. No approval manager is needed.Example domain: A support knowledge base assistant that answers agent questions from a policy corpus. The support agent reads the answer and decides what to tell the customer. The model takes no action in any ticketing system.

Choosing and upgrading autonomy levels

Use this decision path when starting a new agent or evaluating whether to increase autonomy:
1

Identify the primary job-to-be-done

What is the one task this agent must accomplish? Define a measurable done condition before choosing a level.
2

Start at the lowest level that creates value

Answer-only and draft-only are almost always sufficient for a first version. They are faster to build, easier to evaluate, and safer to deploy.
3

Run evals at the current level

Measure task success rate, error rate, and user satisfaction. Document specific failure cases where the current level is insufficient.
4

Move up only when evals justify it

If evals show the current level cannot meet the use case requirements, move up exactly one level. Re-run evals at the new level before promoting to production.
5

Narrow scope when increasing autonomy

Each increase in autonomy level should come with a corresponding decrease in tool scope. More autonomy requires narrower, more tightly permissioned tools.

Build docs developers (and LLMs) love