Hindsight — agent memory that learns over time

Hindsight is an open-source agent memory system built to make AI agents smarter over time. Unlike simple conversation history or vector search, Hindsight uses biomimetic memory structures — organizing information into world facts, experiences, observations, and mental models — so agents can reason about what they know, not just what they’ve stored.

Quick Start

Add persistent memory to your agent in under 60 seconds

Installation

Deploy with Docker, Helm, pip, or embedded in Python

API Reference

Explore retain, recall, and reflect endpoints

Integrations

Connect with LangChain, LlamaIndex, CrewAI, OpenAI Agents, and more

How Hindsight works

Hindsight exposes three simple operations that handle all memory interactions:

Retain — store information

Pass text, conversations, or documents to Hindsight. The LLM extracts facts, entities, and relationships and stores them in a structured knowledge graph — not raw text.

client.retain(bank_id="my-agent", content="Alice works at Google as a software engineer")

Recall — search memories

Query memories using four parallel strategies: semantic similarity, keyword (BM25), graph traversal, and temporal reasoning. Results are fused and reranked automatically.

results = client.recall(bank_id="my-agent", query="Where does Alice work?")

Reflect — reason with memory

Generate disposition-aware responses grounded in retrieved memories. Optionally configure a mission, directives, and personality traits per memory bank.

answer = client.reflect(bank_id="my-agent", query="Tell me about Alice")

Key capabilities

Biomimetic memory hierarchy

Facts, experiences, observations, and mental models mirror how human memory consolidates knowledge over time

TEMPR multi-strategy retrieval

Semantic, keyword, graph, and temporal search run in parallel — fused with RRF and reranked with a cross-encoder

Automatic consolidation

Related facts are deduplicated and synthesized into observations, with evidence tracking and freshness trends

Mission & disposition

Configure per-bank personality, directives, and reasoning style to shape how the agent responds

20+ LLM providers

Works with OpenAI, Anthropic, Gemini, Groq, Ollama, local models, and any OpenAI-compatible endpoint

Built-in MCP server

Every memory bank exposes an MCP endpoint — connect Claude, Cursor, or any MCP-compatible tool directly

Core concepts

Memory architecture

Understand the hierarchy of memory types and how they interact

Retain

How Hindsight extracts and structures facts from raw content

Recall

How TEMPR’s four-way retrieval finds the right memories

Memory banks

Configure mission, directives, and disposition per agent

SDKs & clients

Python SDK

pip install hindsight-client

Node.js SDK

npm install @vectorize-io/hindsight-client

Go client

go get github.com/vectorize-io/hindsight/hindsight-clients/go

CLI

curl -fsSL https://hindsight.vectorize.io/get-cli | bash

Auto-generate your docs

How Hindsight works
Key capabilities
Core concepts
SDKs & clients

Build docs developers (and LLMs) love

Get started for free Talk to us

Get Started

Core Concepts

SDKs & Clients

Integrations

Deployment & Operations

Hindsight — agent memory that learns over time

Quick Start

Installation

API Reference

Integrations

How Hindsight works

Key capabilities

Biomimetic memory hierarchy

TEMPR multi-strategy retrieval

Automatic consolidation

Mission & disposition

20+ LLM providers

Built-in MCP server

Core concepts

Memory architecture

Retain

Recall

Memory banks

SDKs & clients

Python SDK

Node.js SDK

Go client

CLI

Build docs developers (and LLMs) love

Get Started

Core Concepts

SDKs & Clients

Integrations

Deployment & Operations

Documentation Index

Quick Start

Installation

API Reference

Integrations

​How Hindsight works

​Key capabilities

Biomimetic memory hierarchy

TEMPR multi-strategy retrieval

Automatic consolidation

Mission & disposition

20+ LLM providers

Built-in MCP server

​Core concepts

Memory architecture

Retain

Recall

Memory banks

​SDKs & clients

Python SDK

Node.js SDK

Go client

CLI

Build docs developers (and LLMs) love

How Hindsight works

Key capabilities

Core concepts

SDKs & clients