Reflect API — reason over agent memories

The reflect endpoint generates a synthesized, grounded response to a query by running an agentic reasoning loop over a memory bank. Unlike recall — which returns raw facts — reflect retrieves memories autonomously using multiple tools, applies the bank’s disposition traits and directives to shape the reasoning style, and produces a final answer written by the LLM. Mental models are checked first during retrieval, followed by observations, then raw facts.

To learn about disposition-driven reasoning and the agentic loop in depth, see the Reflect Architecture guide.

Endpoint

POST /v1/{tenant}/banks/{bank_id}/reflect

Request parameters

path.tenant

string

required

Your tenant identifier. Use default for single-tenant deployments.

path.bank_id

string

required

The memory bank to reason over.

query

string

required

The question or prompt to reflect on. If you have situational context that should influence the answer, include it directly in the query rather than as a separate field.

budget

string

default:"low"

Controls how thoroughly the agent explores the memory bank before answering. Accepted values: low (shallow search, optimized for speed), mid (checks multiple sources when the question warrants it), high (deep exploration across all knowledge levels, uses multiple query variations to find indirect connections). Use high for complex questions that require synthesizing information from many sources.

max_tokens

integer

default:"4096"

Limits the length of the final generated response. Does not affect how much the agent can retrieve during the agentic loop — only the final answer length.

response_schema

object

An optional JSON Schema object. When provided, the LLM generates a response conforming to the schema and the response includes a structured_output field with the result parsed accordingly. The text field will be empty. Use this when you need to process the response programmatically rather than display it as prose.

Response fields

text

string

required

The synthesized answer as a well-formatted markdown string. This is the primary output of reflect. Empty when response_schema is provided — use structured_output instead.

structured_output

object

The LLM’s response parsed according to the response_schema provided in the request. Only present when response_schema was set. null otherwise.

based_on

object

The sources the agent used to construct the answer. Only present when include.facts was enabled.

Show fields

memories

object[]

Memory facts retrieved and cited. Each item has id, text, type, context, occurred_start, and occurred_end.

mental_models

object[]

Mental models used. Each item has id, text, and context.

directives

object[]

Directives enforced during reasoning. Each item has id, name, and content.

usage

object

Token usage for all LLM calls made during the agentic loop.

Show fields

input_tokens

integer

Total tokens sent to the LLM across all agent loop iterations.

output_tokens

integer

Total tokens received from the LLM.

total_tokens

integer

Combined total.

trace

object

Full execution log of the agentic loop. Only present when include.tool_calls was enabled.

Show fields

tool_calls

object[]

Each tool invocation with tool name (lookup, recall, learn, expand), input, output (if output: true), duration_ms, and iteration number.

llm_calls

object[]

Each LLM call with scope (e.g. "agent_1", "final") and duration_ms.

Examples

Basic reflect

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/reflect" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "What should I know about Alice before our meeting today?"
  }'

Reflect with budget and source attribution

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/reflect" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "Summarize all architectural decisions made in Q1 2024 and their rationale.",
    "budget": "high",
    "include": {"facts": {}}
  }'

Structured output

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/reflect" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "What are Alice'\''s top three priorities this week?",
    "response_schema": {
      "type": "object",
      "properties": {
        "priorities": {
          "type": "array",
          "items": {"type": "string"},
          "maxItems": 3
        }
      },
      "required": ["priorities"]
    }
  }'

Tag-scoped reflect

response = client.reflect(
    bank_id="my-bank",
    query="What does this user prefer?",
    tags=["user:alice"],
    tags_match="all_strict",
)
print(response.text)

Error codes

Status	Code	Description
`400`	`invalid_request`	Malformed request body or missing required fields.
`401`	`unauthorized`	Missing or invalid API key.
`404`	`bank_not_found`	The specified bank does not exist.
`422`	`validation_error`	One or more parameters failed validation.
`429`	`rate_limited`	Too many requests. Retry with exponential backoff.
`500`	`internal_error`	Server error during the agentic loop.

Core Methods

Resources

Reflect API — reason over agent memories

Endpoint

Request parameters

Response fields

Examples

Basic reflect

Reflect with budget and source attribution

Structured output

Tag-scoped reflect

Error codes

Build docs developers (and LLMs) love

Core Methods

Resources

Documentation Index

​Endpoint

​Request parameters

​Response fields

​Examples

​Basic reflect

​Reflect with budget and source attribution

​Structured output

​Tag-scoped reflect

​Error codes

Build docs developers (and LLMs) love

Endpoint

Request parameters

Response fields

Examples

Basic reflect

Reflect with budget and source attribution

Structured output

Tag-scoped reflect

Error codes