Retain API — ingest content as agent memories

The retain endpoint ingests raw content into a memory bank and transforms it into structured, queryable memories. Hindsight runs LLM-based fact extraction on the content, resolves entities against the existing knowledge graph, and stores the resulting facts — not the original text. A single retain call can produce many memories depending on how much information the content contains.

To learn about fact extraction, entity resolution, and graph construction in depth, see the Retain Architecture guide.

Endpoint

POST /v1/{tenant}/banks/{bank_id}/retain

Request parameters

path.tenant

string

required

Your tenant identifier. Use default for single-tenant deployments.

path.bank_id

string

required

The memory bank to store content into. The bank is created automatically if it does not exist.

items

object[]

required

One or more content items to retain. Each item is a piece of raw content — a conversation, document, or note — that Hindsight will analyze and decompose into structured memories.

Show Item fields

content

string

required

The raw text to extract facts from. Hindsight chunks the content, sends each chunk to the LLM, and stores the resulting facts — not the original text.

timestamp

string

When the event described in the content occurred. Accepts ISO 8601 strings (e.g. "2024-01-15T10:30:00Z"), null to default to the current ingestion time, or "unset" to store the content without any timestamp. Use "unset" for timeless reference material such as books or documentation.

context

string

A short label describing the source or situation — for example "team meeting", "slack", or "support ticket". Injected directly into the LLM extraction prompt, so it actively shapes how facts are interpreted. Providing context consistently is one of the highest-leverage things you can do to improve memory quality.

metadata

object

Arbitrary key-value string pairs that provide context about this item. For example: {"source": "slack", "channel": "engineering", "thread_id": "T123"}. Metadata is included in the extraction prompt and stored on each extracted memory, so it is returned with every recall result.

document_id

string

A caller-supplied string that groups one or more items under a logical document and makes retain idempotent. When provided, Hindsight upserts the document: if a document with that ID already exists, it and all its associated memories are deleted before the new content is processed. Omit to assign a random UUID per request, which will create duplicate memories on re-ingestion.

update_mode

string

default:"replace"

Controls how Hindsight handles an existing document when retaining with a document_id that already exists. "replace" deletes the old document and all its memories, then processes the new content from scratch. "append" concatenates the new content onto the existing document and reprocesses the combined document — only new chunks trigger LLM extraction. Append mode requires a document_id.

Response fields

success

boolean

required

Whether the operation completed without errors.

bank_id

string

required

The memory bank that received the content.

items_count

integer

required

Number of items processed in this request.

async

boolean

required

Whether processing ran asynchronously.

operation_id

string

Present when async is true. Use this ID to poll GET /v1/{tenant}/banks/{bank_id}/operations/{operation_id} for completion status.

usage

object

Token usage for the LLM extraction calls. Only present for synchronous operations.

Show fields

input_tokens

integer

Tokens sent to the LLM.

output_tokens

integer

Tokens received from the LLM.

total_tokens

integer

Total tokens consumed.

Examples

Basic retain

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/retain" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "items": [
      {
        "content": "Alice joined the platform in January 2024 and prefers async communication over meetings.",
        "context": "onboarding",
        "metadata": {"source": "crm", "account_id": "acc_123"}
      }
    ]
  }'

Batch retain

Submit multiple items in a single request to reduce network overhead. Hindsight processes the batch and can optimize extraction across related content.

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/retain" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "items": [
      {
        "content": "Alice prefers async communication.",
        "context": "slack",
        "tags": ["user:alice"]
      },
      {
        "content": "Bob joined the engineering team in Q1 2024.",
        "context": "hr-system",
        "tags": ["user:bob"]
      }
    ]
  }'

Async retain

For large batches, use async mode to avoid blocking your application.

curl -X POST "https://your-hindsight-host/v1/default/banks/my-bank/retain" \
  -H "Authorization: Bearer $HINDSIGHT_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "items": [{"content": "...large batch of content..."}],
    "async": true
  }'

Enable provider Batch API support for 50% cost reduction on async retain by setting HINDSIGHT_API_RETAIN_BATCH_ENABLED=true. OpenAI and Groq both offer this discount in exchange for a processing window of up to 24 hours.

Error codes

Status	Code	Description
`400`	`invalid_request`	Malformed request body or missing required fields.
`401`	`unauthorized`	Missing or invalid API key.
`404`	`bank_not_found`	The specified bank does not exist and could not be created.
`422`	`validation_error`	One or more item fields failed validation.
`429`	`rate_limited`	Too many requests. Retry with exponential backoff.
`500`	`internal_error`	Server error during processing.

See the Retain Architecture guide for a deep dive into how Hindsight extracts facts, resolves entities, and builds the knowledge graph.

Core Methods

Resources

Retain API — ingest content as agent memories

Endpoint

Request parameters

Response fields

Examples

Basic retain

Batch retain

Async retain

Error codes

Build docs developers (and LLMs) love

Core Methods

Resources

Documentation Index

​Endpoint

​Request parameters

​Response fields

​Examples

​Basic retain

​Batch retain

​Async retain

​Error codes

Build docs developers (and LLMs) love

Endpoint

Request parameters

Response fields

Examples

Basic retain

Batch retain

Async retain

Error codes