Vectorless Approach

PageIndex is a vectorless system - it achieves state-of-the-art RAG performance without using vector databases, embeddings, or semantic similarity search. This architectural choice fundamentally changes how document retrieval works.

What Does “Vectorless” Mean?

Traditional RAG systems require several vector-based components:

Embedding Model

Converts text chunks into high-dimensional vectors

Vector Database

Stores and indexes embeddings for similarity search

Similarity Search

Computes distance metrics between query and document vectors

Chunk Management

Handles splitting, overlapping, and metadata for chunks

PageIndex eliminates all of these. Instead, it uses document structure and LLM reasoning.

How PageIndex Works Without Vectors

PageIndex replaces the entire vector-based pipeline with two core components:

1. Hierarchical Tree Index

Instead of embedding chunks, PageIndex creates a semantic tree structure:

{
  "title": "Financial Stability",
  "start_index": 21,
  "end_index": 31,
  "node_id": "0006",
  "nodes": [
    {
      "title": "Monitoring Financial Vulnerabilities",
      "start_index": 22,
      "end_index": 28,
      "node_id": "0007"
    }
  ]
}

This tree is stored as plain JSON - no vector database needed.

2. LLM Reasoning for Retrieval

Instead of similarity search, PageIndex uses the LLM to reason about which nodes are relevant:

Query: "What financial vulnerabilities were monitored?"

LLM Reasoning:
- Scan root nodes for relevant sections
- "Financial Stability" is directly related to the query
- Examine child nodes under "Financial Stability"
- "Monitoring Financial Vulnerabilities" matches the query intent
- Retrieve pages 22-28

The LLM reads the tree structure (titles, summaries, page ranges) and makes intelligent decisions about where to look - no vector math required.

Why Go Vectorless?

1. Better Accuracy

PageIndex achieves 98.7% accuracy on FinanceBench without vectors. Why? Vectors lose information: Compressing text into fixed-dimensional embeddings loses nuance, structure, and context. Reasoning preserves understanding: LLMs can fully comprehend section titles, hierarchical relationships, and semantic meaning.

Example: A section titled “Regulatory Developments” has clear semantic meaning that’s preserved in text but compressed away in a 1536-dimensional embedding.

2. True Explainability

Vector similarity is fundamentally opaque:

Vector RAG: "Retrieved chunk 47 (cosine similarity: 0.83)"
❌ Why? What made this relevant? Unknown.

PageIndex reasoning is transparent:

PageIndex: "Retrieved 'Monitoring Financial Vulnerabilities' (pages 22-28)"
✅ Why? The query asks about vulnerabilities, and this section's title 
   directly indicates it discusses monitoring them.

3. No Infrastructure Overhead

Vector RAG requires significant infrastructure:

Vector databases: Pinecone, Weaviate, Qdrant, Milvus
Embedding APIs: OpenAI, Cohere, or self-hosted models
Index management: Building, updating, and maintaining vector indices
Dimension tuning: Choosing embedding dimensions and distance metrics

PageIndex requires:

JSON storage: Any file system or database
LLM API: For generation and reasoning (already needed for RAG)

Simplifying your stack reduces maintenance burden, costs, and potential failure points.

4. No Chunking Problems

Vector RAG must split documents into chunks for embedding:

# Traditional RAG: Arbitrary chunking
chunks = split_text(
    text, 
    chunk_size=1000,  # Arbitrary token limit
    overlap=200        # Overlap to preserve context
)

Problems with chunking:

Lost boundaries: Important sections split mid-thought
Lost hierarchy: Parent-child relationships destroyed
Lost context: Chunks don’t know their position in the document
Parameter sensitivity: Results vary wildly with chunk size

PageIndex uses natural document sections:

{
  "title": "Entertainment Segment Results",
  "start_index": 1,
  "end_index": 2
}

Benefits:

Natural boundaries: Follows document structure
Preserved hierarchy: Parent-child relationships maintained
Full context: Exact page ranges and position
No parameters: Structure is inherent to the document

Chunking is one of the most challenging aspects of traditional RAG. Different chunk sizes work better for different queries, making it nearly impossible to optimize.

5. Dynamic Context Windows

Vector RAG is limited by fixed chunk sizes:

Chunk size: 1000 tokens
↓
Always retrieve ~1000 tokens per chunk
even if you need more or less context

PageIndex adapts to natural section sizes:

[
  {"title": "Overview", "start_index": 7, "end_index": 8},
  {"title": "Supervisory Developments", "start_index": 35, "end_index": 54}
]

The first section is 1-2 pages, the second is 19 pages. PageIndex retrieves the right amount of context based on document structure, not arbitrary limits.

Technical Architecture Comparison

Traditional Vector RAG Pipeline

PageIndex Vectorless Pipeline

Notice how PageIndex eliminates the embedding and vector search steps entirely, simplifying the architecture while improving performance.

What About Scalability?

Common question: “Don’t vector databases scale better than reading JSON trees?” Answer: PageIndex scales differently, but effectively:

Vector DB Scaling

Millions of documents: Vector DBs excel at searching millions of embeddings
Per-document cost: Constant time lookup (O(log n) or better with indexing)
Use case: Finding similar documents across a large corpus

PageIndex Scaling

Single document depth: PageIndex excels at deep understanding of individual documents
Per-document cost: Proportional to tree depth (typically O(log n) nodes)
Use case: Extracting precise information from long, complex documents

Real-world reality: Most enterprise RAG applications deal with 10-10,000 documents, not millions. At this scale, PageIndex’s JSON-based approach is more than sufficient and far simpler to maintain.

When Vectors Are Still Useful

The vectorless approach is optimal for: ✅ Long, structured documents (reports, manuals, textbooks) ✅ Professional/technical documents requiring precision ✅ Scenarios requiring explainability ✅ Complex, multi-step queries Vectors may still be better for:

Semantic search over large, unstructured text corpora
Finding similar documents across millions of items
Fuzzy matching where approximate results are acceptable
Simple Q&A over homogeneous content

You can even combine approaches: Use PageIndex for structured documents in your corpus and vector search for unstructured notes or comments.

The AlphaGo Inspiration

PageIndex draws inspiration from AlphaGo’s success in mastering Go: AlphaGo: Used tree search (MCTS) + neural networks to evaluate board positions PageIndex: Uses tree search + LLM reasoning to evaluate document sections Both demonstrate that search + reasoning can outperform pure similarity-based approaches.

Cost Implications

Removing vectors affects costs in several ways:

Eliminated Costs

❌ Vector database hosting/licensing
❌ Embedding API calls (millions of chunks)
❌ Index maintenance and updates

New Costs

✅ Tree structure generation (one-time per document)
✅ LLM reasoning during retrieval (typically 2-5 tree search steps)

For most use cases, especially with long documents that would generate thousands of chunks, the vectorless approach is significantly more cost-effective.

Getting Started Without Vectors

Try PageIndex’s vectorless RAG approach:

from pageindex import page_index

# Generate tree structure (no vectors needed)
result = page_index(
    doc="financial_report.pdf",
    model="gpt-4o",
    if_add_node_summary="yes"
)

# Tree is stored as JSON
with open("tree.json", "w") as f:
    json.dump(result, f)

Next Steps

Tree Structure

Deep dive into PageIndex’s hierarchical tree format

Vectorless RAG Tutorial

Build a complete RAG system without vectors

API Quickstart

Start generating tree structures via API

Vision RAG

OCR-free, vectorless RAG with page images

Get Started

Core Concepts

Guides

Cookbook

Tutorials

Vectorless Approach

What Does “Vectorless” Mean?

Embedding Model

Vector Database

Similarity Search

Chunk Management

How PageIndex Works Without Vectors

1. Hierarchical Tree Index

2. LLM Reasoning for Retrieval

Why Go Vectorless?

1. Better Accuracy

2. True Explainability

3. No Infrastructure Overhead

4. No Chunking Problems

5. Dynamic Context Windows

Technical Architecture Comparison

Traditional Vector RAG Pipeline

PageIndex Vectorless Pipeline

What About Scalability?

Vector DB Scaling

PageIndex Scaling

When Vectors Are Still Useful

The AlphaGo Inspiration

Cost Implications

Eliminated Costs

New Costs

Getting Started Without Vectors

Next Steps

Tree Structure

Vectorless RAG Tutorial

API Quickstart

Vision RAG

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Cookbook

Tutorials

Documentation Index

​What Does “Vectorless” Mean?

Embedding Model

Vector Database

Similarity Search

Chunk Management

​How PageIndex Works Without Vectors

​1. Hierarchical Tree Index

​2. LLM Reasoning for Retrieval

​Why Go Vectorless?

​1. Better Accuracy

​2. True Explainability

​3. No Infrastructure Overhead

​4. No Chunking Problems

​5. Dynamic Context Windows

​Technical Architecture Comparison

​Traditional Vector RAG Pipeline

​PageIndex Vectorless Pipeline

​What About Scalability?

​Vector DB Scaling

​PageIndex Scaling

​When Vectors Are Still Useful

​The AlphaGo Inspiration

​Cost Implications

​Eliminated Costs

​New Costs

​Getting Started Without Vectors

​Next Steps

Tree Structure

Vectorless RAG Tutorial

API Quickstart

Vision RAG

Build docs developers (and LLMs) love

What Does “Vectorless” Mean?

How PageIndex Works Without Vectors

1. Hierarchical Tree Index

2. LLM Reasoning for Retrieval

Why Go Vectorless?

1. Better Accuracy

2. True Explainability

3. No Infrastructure Overhead

4. No Chunking Problems

5. Dynamic Context Windows

Technical Architecture Comparison

Traditional Vector RAG Pipeline

PageIndex Vectorless Pipeline

What About Scalability?

Vector DB Scaling

PageIndex Scaling

When Vectors Are Still Useful

The AlphaGo Inspiration

Cost Implications

Eliminated Costs

New Costs

Getting Started Without Vectors

Next Steps