Hindsight is an open-source agent memory system built to make AI agents smarter over time. Unlike simple conversation history or vector search, Hindsight uses biomimetic memory structures — organizing information into world facts, experiences, observations, and mental models — so agents can reason about what they know, not just what they’ve stored.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/vectorize-io/hindsight/llms.txt
Use this file to discover all available pages before exploring further.
Quick Start
Add persistent memory to your agent in under 60 seconds
Installation
Deploy with Docker, Helm, pip, or embedded in Python
API Reference
Explore retain, recall, and reflect endpoints
Integrations
Connect with LangChain, LlamaIndex, CrewAI, OpenAI Agents, and more
How Hindsight works
Hindsight exposes three simple operations that handle all memory interactions:Retain — store information
Pass text, conversations, or documents to Hindsight. The LLM extracts facts, entities, and relationships and stores them in a structured knowledge graph — not raw text.
Recall — search memories
Query memories using four parallel strategies: semantic similarity, keyword (BM25), graph traversal, and temporal reasoning. Results are fused and reranked automatically.
Key capabilities
Biomimetic memory hierarchy
Facts, experiences, observations, and mental models mirror how human memory consolidates knowledge over time
TEMPR multi-strategy retrieval
Semantic, keyword, graph, and temporal search run in parallel — fused with RRF and reranked with a cross-encoder
Automatic consolidation
Related facts are deduplicated and synthesized into observations, with evidence tracking and freshness trends
Mission & disposition
Configure per-bank personality, directives, and reasoning style to shape how the agent responds
20+ LLM providers
Works with OpenAI, Anthropic, Gemini, Groq, Ollama, local models, and any OpenAI-compatible endpoint
Built-in MCP server
Every memory bank exposes an MCP endpoint — connect Claude, Cursor, or any MCP-compatible tool directly
Core concepts
Memory architecture
Understand the hierarchy of memory types and how they interact
Retain
How Hindsight extracts and structures facts from raw content
Recall
How TEMPR’s four-way retrieval finds the right memories
Memory banks
Configure mission, directives, and disposition per agent
SDKs & clients
Python SDK
pip install hindsight-clientNode.js SDK
npm install @vectorize-io/hindsight-clientGo client
go get github.com/vectorize-io/hindsight/hindsight-clients/goCLI
curl -fsSL https://hindsight.vectorize.io/get-cli | bash