Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/vectorize-io/hindsight/llms.txt

Use this file to discover all available pages before exploring further.

Most AI agents forget everything the moment a session ends. Every new conversation starts from zero — no context about the user, no lessons learned, no accumulated knowledge. Hindsight is built to change that. It gives your agents a structured, persistent memory system that learns over time, not just one that remembers raw conversation history.
Hindsight has achieved state-of-the-art performance on the LongMemEval benchmark — independently reproduced by researchers at Virginia Tech and The Washington Post.

The problem with existing approaches

Simple vector search is not enough for real agent memory. Answering “What did Alice work on last spring?” requires temporal reasoning, not just semantic similarity. Knowing “Alice works at Google” and “Google is in Mountain View” should let an agent answer “Where does Alice work?” even if that exact fact was never stored. And an agent that receives conflicting information over time needs to consolidate that knowledge rather than pile up contradictions. Hindsight addresses these limitations with a memory architecture designed specifically for how agents need to learn and reason.

Biomimetic memory hierarchy

Hindsight organizes knowledge into a structured hierarchy inspired by how human memory works:
TypeWhat it storesExample
Mental ModelUser-curated summaries for common queries”Team communication best practices”
ObservationAutomatically consolidated knowledge from raw facts”User was a React enthusiast but has now switched to Vue”
World FactObjective facts about the world”Alice works at Google as a software engineer”
Experience FactThe agent’s own actions and interactions”I recommended Python to Bob last Tuesday”
Memories are stored in memory banks — isolated namespaces you can scope per user, per agent, or per use case. When new content is retained, Hindsight extracts entities, relationships, and temporal data, then represents them across multiple search indexes for accurate retrieval later.

TEMPR: multi-strategy retrieval

When you recall memories, Hindsight runs four search strategies in parallel and merges their results using reciprocal rank fusion and cross-encoder reranking:
StrategyBest for
SemanticConceptual similarity, paraphrasing
Keyword (BM25)Names, technical terms, exact matches
GraphRelated entities, indirect connections
Temporal”last spring”, “in June”, time ranges
This is the TEMPR retrieval system — Temporal, Entity, Meaning, Pattern, Rank. No single strategy dominates; all four run together and the best results surface to the top.

The three core operations

Every interaction with Hindsight uses one of three operations:
  • Retain — Store new information. Hindsight extracts facts, entities, and relationships from the content and indexes them for later recall.
  • Recall — Search memories. All four retrieval strategies run in parallel and results are reranked by relevance.
  • Reflect — Reason over memories. Hindsight uses the bank’s mission, directives, and disposition traits to generate a contextual, grounded response.
Reflect checks memory sources in priority order: Mental Models → Observations → Raw Facts. This means curated knowledge surfaces before raw facts.

Configurable bank identity

Memory banks can be configured with a mission (what the bank knows and cares about), directives (hard rules that must never be violated), and disposition traits (skepticism, literalism, empathy on a 1–5 scale). These settings shape how reflect reasons — not recall — so retrieval stays neutral while generation stays on-brand.

Where to go next

Quickstart

Get Hindsight running and store your first memory in under 60 seconds.

Deploy Hindsight

Docker, Helm/Kubernetes, pip, or embedded in a Python application.

Python SDK

Full API reference for the Python client, including async usage.

Integrations

Connect Hindsight to LangGraph, CrewAI, n8n, and more.

Build docs developers (and LLMs) love