Documentation Index
Fetch the complete documentation index at: https://mintlify.com/MemoriLabs/Memori/llms.txt
Use this file to discover all available pages before exploring further.
How Memory Works
Memori gives your AI application long-term memory. Instead of forgetting everything after each conversation, your AI can remember facts, preferences, and context across sessions and across different applications.Attribution
Every memory in Memori is tagged with three dimensions: who (entity), what (process), and which conversation (session).- Entity (
entity_id) — The person, place, or thing generating memories. Typically a user ID (e.g.,"user_alice","company_acme"). Maximum length: 100 characters. - Process (
process_id) — The agent, program, or workflow creating memories (e.g.,"support_bot","code_review_agent"). Maximum length: 100 characters. - Session (
session_id) — Groups related LLM interactions into a conversation thread. Auto-generated as a UUID by default.
entity_id + process_id + session_id creates a unique memory scope — different users have isolated memories, the same user can have different context in different applications, and each conversation is tracked separately.
Memory Types
When you have a conversation through a Memori-wrapped LLM client, Advanced Augmentation extracts structured memories in the background:| Type | What it captures | Example |
|---|---|---|
| Facts | Objective information with embeddings | ”User uses PostgreSQL for production databases” |
| Preferences | Choices, opinions, and tastes | ”Prefers concise answers” |
| Skills & Knowledge | Abilities and expertise levels | ”Experienced with React (5 years)“ |
| Attributes | Process-level information about the agent | ”Handles billing and subscription queries” |
How Recall Works
Recall brings stored memories back into your AI conversations. There are two modes.Automatic Recall (Default)
On every LLM call, Memori automatically:- Intercepts the outbound request
- Uses semantic search to find relevant facts for the current entity
- Injects the most relevant memories into the system prompt
- Forwards the enriched request to the LLM
Manual Recall
Usemem.recall() to retrieve memories explicitly — useful for building custom prompts, displaying memories in a UI, or debugging.
id, content, similarity (0–1 relevance score), rank_score, and date_created.
Recall Configuration
Memori uses semantic search (vector similarity) to find relevant facts. You can tune recall behavior with:| Option | Default | Description |
|---|---|---|
mem.config.recall_relevance_threshold | 0.1 | Minimum similarity score for a fact to be included |
mem.config.recall_embeddings_limit | 1000 | Maximum number of embeddings to compare against |
mem.config.recall_facts_limit | 5 | Default number of facts to return |
Memory Lifecycle
- Conversation — Your user talks to your AI through the wrapped LLM client
- Capture — Memori intercepts and stores the raw conversation
- Augmentation — Advanced Augmentation processes the conversation asynchronously, extracting structured memories
- Extraction — Facts, preferences, skills, and attributes are identified
- Storage — Extracted memories are stored with vector embeddings
- Recall — On the next LLM call, relevant memories are retrieved and injected into context
Session Management
Sessions group related conversations together. Each session has a timeout (default: 30 minutes) that determines when a new conversation starts.Sessions are automatically managed by Memori. You only need to explicitly manage sessions if you want to group or separate conversations in a specific way.