Agentic Retrieval with PageIndex Chat

Introduction

Similarity-based RAG based on Vector-DB has shown big limitations in recent AI applications. Reasoning-based or agentic retrieval has become important in current developments. However, unlike classic RAG pipeline with embedding input, top-K chunks returns, and re-rank, what should an agentic-native retrieval API look like? For an agentic-native retrieval system, we need the ability to prompt for retrieval just as naturally as you interact with ChatGPT. Below, we provide an example of how the PageIndex Chat API enables this style of prompt-driven retrieval.

PageIndex Chat API

PageIndex Chat is an AI assistant that allows you to chat with multiple super-long documents without worrying about limited context or context rot problems. It is based on PageIndex, a vectorless reasoning-based RAG framework which gives more transparent and reliable results like a human expert. Vectorless RAG

You can now access PageIndex Chat with API or SDK.

What You’ll Learn

This cookbook demonstrates a simple, minimal example of agentic retrieval with PageIndex. You will learn:

How to use PageIndex Chat API
How to prompt the PageIndex Chat to make it a retrieval system

Setup

Install PageIndex SDK

pip install --upgrade pageindex

Setup PageIndex Client

from pageindex import PageIndexClient

# Get your PageIndex API key from https://dash.pageindex.ai/api-keys
PAGEINDEX_API_KEY = "YOUR_PAGEINDEX_API_KEY"
pi_client = PageIndexClient(api_key=PAGEINDEX_API_KEY)

Upload a Document

Download and submit a document to PageIndex:

import os, requests

pdf_url = "https://arxiv.org/pdf/2507.13334.pdf"
pdf_path = os.path.join("../data", pdf_url.split('/')[-1])
os.makedirs(os.path.dirname(pdf_path), exist_ok=True)

response = requests.get(pdf_url)
with open(pdf_path, "wb") as f:
    f.write(response.content)
print(f"Downloaded {pdf_url}")

doc_id = pi_client.submit_document(pdf_path)["doc_id"]
print('Document Submitted:', doc_id)

Output:

Downloaded https://arxiv.org/pdf/2507.13334.pdf
Document Submitted: pi-cmi34m6jy01sg0bqzofch62n8

Check Processing Status

Verify that the document has been processed:

from pprint import pprint

doc_info = pi_client.get_document(doc_id)
pprint(doc_info)

if doc_info['status'] == 'completed':
    print(f"\n Document ready! ({doc_info['pageNum']} pages)")
elif doc_info['status'] == 'processing':
    print("\n Document is still processing. Please wait and check again.")

Output:

{'createdAt': '2025-11-16T08:36:41.177000',
 'description': 'This survey provides a comprehensive overview and taxonomy of '
                'Context Engineering for Large Language Models, covering '
                'foundational components, system implementations, evaluation '
                'methods, and future research directions.',
 'id': 'pi-cmi1gp1hg01t20do2l3bgzwz1',
 'name': '2507.13334_19.pdf',
 'pageNum': 166,
 'status': 'completed'}

 Document ready! (166 pages)

Ask Questions About the Document

Use the PageIndex Chat API to ask questions:

query = "What are the evaluation methods used in this paper?"

for chunk in pi_client.chat_completions(
    messages=[{"role": "user", "content": query}],
    doc_id=doc_id,
    stream=True
):
    print(chunk, end='', flush=True)

Sample Output:

I'll help you find the evaluation methods used in this paper. Let me start by examining the document 
structure to locate the relevant sections.

Perfect! I can see there's a dedicated section on "Evaluation" (node_id: 0015) that covers pages 45-50. 
Let me extract the content from those pages to get detailed information about the evaluation methods.

Based on the comprehensive evaluation section of the paper, here are the **evaluation methods** used:

## Evaluation Framework Overview

The paper presents a comprehensive evaluation framework organized into **Component-Level Assessment** 
and **System-Level Integration Assessment**.

### 1. Component-Level Assessment (Intrinsic Evaluation)

#### Prompt Engineering Evaluation:
- **Semantic similarity metrics**
- **Response quality assessment**
- **Robustness testing** across diverse input variations
- **Contextual calibration** assessment

#### Long Context Processing Evaluation:
- **"Needle in a haystack"** evaluation paradigm
- **Multi-document reasoning tasks**
- **Position interpolation techniques** evaluation
- **Information retention, positional bias, and reasoning coherence** metrics

[...continues with detailed evaluation methods...]