Local Document Search with RAG and ChromaDB

The Local RAG feature lets you search your own documents alongside live web content. Instead of (or in addition to) querying the internet, Spy Search converts your files to Markdown, indexes them in a local ChromaDB vector store, retrieves the most relevant passages at query time, and passes them to the LLM as context.

How It Works

Document conversion — Every file in your configured directory is converted to Markdown using MarkItDown. MarkItDown handles PDF, Word, Excel, PowerPoint, plain text, and Markdown natively, preserving structure and formatting as plain text.
Chunking — The Markdown text is indexed using a 1500-character boundary. The _file_handler method iterates over every character and stores a snapshot of the accumulated text into ChromaDB at every 1500th character position. Each stored document includes metadata recording the source file path.
Embedding and indexing — ChromaDB uses an Ollama embedding function (nomic-embed-text:latest by default, served at http://localhost:11434) to convert each chunk into a vector and persist it under ./local_db.
Retrieval at query time — When the local-retrieval agent runs, it calls db.query(task, 2) to fetch the top-2 most relevant chunks via vector similarity search.
LLM synthesis — Each retrieved chunk is passed through a retrieval_prompt and sent to the LLM, which generates a structured summary. The summaries are appended to the shared data array and forwarded to the next agent in the pipeline (typically the Reporter).

The RAG agent calls db.reset() at initialisation on every run. This clears and re-creates the ChromaDB collection, ensuring the index always reflects the current contents of your files directory rather than a stale snapshot.

Enabling Local RAG

Add "local-retrieval" to the agents array in your config.json and set the db field to the directory containing your documents:

{
  "provider": "ollama",
  "model": "llama3.2",
  "agents": ["local-retrieval", "reporter"],
  "db": "./local_files/my_docs"
}

If the db key is absent, the agent defaults to ./local_files.

Managing Files via the API

Use the following endpoints to manage your document library without touching the filesystem directly.

Method	Endpoint	Description
`GET`	`/folder_list`	List all available folders
`POST`	`/create_folder`	Create a new folder
`GET`	`/select_folder?folder_name=<name>`	Set the active folder for indexing
`POST`	`/upload_file`	Upload a file (multipart form)
`POST`	`/delete_file`	Delete a file by path

For full request/response schemas, see the File Management API reference.

Upload a file

curl -X POST http://localhost:8000/upload_file \
  -F 'file=@report.pdf' \
  -F 'filepath=my_project'

Create a folder

curl -X POST http://localhost:8000/create_folder \
  -H 'Content-Type: application/json' \
  -d '{"filepath": "my_project"}'

Select a folder

curl "http://localhost:8000/select_folder?folder_name=my_project"

Supported File Types

MarkItDown handles any format it can meaningfully convert to Markdown text. Confirmed supported types include:

File type	Extension(s)
PDF	`.pdf`
Word document	`.docx`, `.doc`
Excel spreadsheet	`.xlsx`, `.xls`
PowerPoint presentation	`.pptx`, `.ppt`
Plain text	`.txt`
Markdown	`.md`, `.mdx`

Any file format that MarkItDown cannot parse will be skipped silently.

ChromaDB Configuration

Setting	Value
Database path	`./local_db`
Embedding model	`nomic-embed-text:latest`
Embedding server	`http://localhost:11434` (Ollama)
Collection name	`local_search`
Chunk boundary	Every 1500 characters
Results returned per query	2 chunks

The VectorSearch class wraps ChromaDB’s PersistentClient with allow_reset=True, so the full database can be wiped and rebuilt in a single client.reset() call.

Example: Local RAG + Report Pipeline

{
  "agents": ["local-retrieval", "reporter"],
  "db": "./local_files/research_papers"
}

curl -X POST "http://localhost:8000/report/summarise+Q3+financial+results" \
  -F 'messages=[{"role":"user","content":"summarise Q3 financial results"}]'

With this configuration, the RAG agent indexes all documents in ./local_files/research_papers, retrieves the most relevant passages, and the Reporter agent writes a structured report grounded in your own files.

Getting Started

Configuration

Core Features

Architecture

Contributing

Local Document Search with RAG and ChromaDB

How It Works

Enabling Local RAG

Managing Files via the API

Upload a file

Create a folder

Select a folder

Supported File Types

ChromaDB Configuration

Example: Local RAG + Report Pipeline

Build docs developers (and LLMs) love

Getting Started

Configuration

Core Features

Architecture

Contributing

Documentation Index

​How It Works

​Enabling Local RAG

​Managing Files via the API

​Upload a file

​Create a folder

​Select a folder

​Supported File Types

​ChromaDB Configuration

​Example: Local RAG + Report Pipeline

Build docs developers (and LLMs) love

How It Works

Enabling Local RAG

Managing Files via the API

Upload a file

Create a folder

Select a folder

Supported File Types

ChromaDB Configuration

Example: Local RAG + Report Pipeline