Gulin Brain is GuLiN’s long-term persistent memory system. While the sliding context window keeps active conversations lean, Gulin Brain stores knowledge permanently — your project structures, coding habits, credentials references, and recurring command patterns — and surfaces that knowledge automatically every time you chat. Think of it as the AI’s growing understanding of how you specifically work, built up over every interaction.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/jorgeurtubiam-ship-it/Gulin_ia/llms.txt
Use this file to discover all available pages before exploring further.
Privacy first: All embeddings are computed locally using the
nomic-embed-text model via Ollama. Your memories never leave your machine. There are no cloud embedding calls, and the vector index lives entirely on your local filesystem.How It Works: Auto-RAG
Every time you send a message to the GuLiN AI, the system performs an automatic Retrieval-Augmented Generation (RAG) lookup before the request reaches the model. This is not a feature you toggle — it runs proactively on every message.- Your message is converted to a vector embedding using
nomic-embed-text. - The embedding is compared semantically against all stored memories.
- The most relevant memories are injected into the system prompt context automatically.
- The AI responds with full awareness of past knowledge — without you having to reference it explicitly.
Memory Storage Location
Memories are stored as plain Markdown files in a local directory:| Platform | Path |
|---|---|
| macOS / Linux | ~/.config/gulin/gulin/ |
| Windows | %APPDATA%\gulin\gulin\ |
Local Embeddings Setup
To enable Gulin Brain, you need Ollama installed with thenomic-embed-text model. This is a one-time setup.
Install Ollama
Download and install Ollama from ollama.com. It runs as a local server on
http://localhost:11434.Pull the embedding model
Pull the The model is approximately 274 MB and needs to be downloaded only once.
nomic-embed-text model, which GuLiN uses for all memory indexing:Start chatting with memory
Gulin Brain activates automatically once the embedding model is available. Ask GuLiN to remember something:GuLiN will call
brain_update to store this as a persistent memory. The next time you ask about the production database in any future session, the relevant memory will be retrieved and injected automatically.Memory Tools
The AI agent has three dedicated tools for interacting with Gulin Brain. You can also ask the AI to use them explicitly in natural language.brain_update
Saves a new piece of knowledge, habit, or context to persistent memory.
brain_list
Lists all memories currently stored in Gulin Brain. Useful for auditing what the AI knows about you.
brain_list and present an organized summary of stored memories.
brain_search
Performs a deep semantic vector search over all stored memories. This is the manual version of the Auto-RAG lookup — useful when you want to explicitly find what GuLiN knows about a specific topic.
Sliding Window and Memory Interaction
The active chat context is limited to the last 4 interactions (8 messages) to keep token usage low. This is intentional: Gulin Brain handles long-term retention so the sliding window can stay narrow without losing information. The relationship between the two systems:- Sliding window: What happened recently in this conversation.
- Gulin Brain: What you have taught GuLiN across all conversations and sessions.
Use Cases
Project Structure
Store the layout of a complex monorepo so the AI always knows where services, configs, and scripts live — without you explaining it every session.
Coding Preferences
Remember your style guide, preferred libraries, naming conventions, and architectural patterns so the AI generates code that fits your codebase from day one.
Infrastructure References
Store hostnames, ports, service names, and environment-specific notes so the AI can reference real infrastructure details without a manual lookup.
Recurring Commands
Capture complex multi-step command sequences and workflows so the AI can reproduce or adapt them in future sessions without starting from scratch.