Session 33 (Season 2, Episode 6 — February 2025) takes a step back from construction and focuses on retrieval: once you have a well-structured knowledge graph in Neo4j, how do you best retrieve the right context for a RAG query? This session runs a systematic, side-by-side comparison of four major retrieval strategies — vector similarity search, keyword (full-text) search, graph traversal, and hybrid combinations — evaluating them on both answer quality and response latency against the same knowledge graph and the same set of questions.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/jbarrasa/goingmeta/llms.txt
Use this file to discover all available pages before exploring further.
Watch the Recording
Full live-stream replay on YouTube
Session Materials
Session slide deck (PDF)
Why Retrieval Strategy Matters
The ontology-guided KG construction pipeline from the preceding sessions produces a graph with two distinct types of content:- Structured entities and relationships — typed nodes with ontology-defined properties connected by semantically labelled relationships
- Text chunks — document fragments stored as
Chunknodes, linked to the structured entities they mention, and embedded as vectors
The Four Retrieval Strategies
1. Vector Similarity Search
Vector search identifies the most semantically similar text chunks to the query by comparing embedding vectors in the Neo4j vector index. It is the default starting point for most RAG systems.2. Full-Text (Keyword) Search
Full-text search uses a Lucene-based index over node text properties. It excels at precise term matching — names, identifiers, dates — that vector search can miss.3. Graph Traversal
Graph traversal uses the structured entity–relationship layer of the KG to answer questions directly, without relying on text chunks at all. The ontology-defined relationships become the retrieval paths.4. Hybrid Retrieval
Hybrid retrieval combines two or more of the above strategies, merging the result sets and optionally re-ranking by score. The most effective hybrid for knowledge graphs pairs vector search (for broad semantic coverage) with graph traversal (for structured fact grounding).The
MENTIONS relationship is created during KG construction — each Chunk node is linked to the structured entities extracted from it. This bidirectional link is what makes hybrid retrieval possible.Benchmarking Dimensions
The session evaluates each strategy across two primary dimensions:Answer Quality
Measured by faithfulness (is the answer grounded in retrieved content?), relevance (does it address the question?), and completeness (does it cover all relevant facts?).
Latency
Wall-clock time from query submission to LLM response. Vector search is typically fastest; multi-hop graph traversal adds index lookup and traversal costs.
Summary of Trade-offs
| Strategy | Best For | Quality | Latency |
|---|---|---|---|
| Vector Search | Conceptual/semantic questions | High for open questions | Low |
| Full-Text Search | Exact name/identifier lookups | High for precise matches | Very Low |
| Graph Traversal | Structured fact retrieval, multi-hop | Highest for factual questions | Medium |
| Hybrid | General-purpose RAG over KGs | Highest overall | Medium–High |
Practical Recommendations
Start with vector search
Vector search is the easiest baseline and performs well for most general questions. If your answers are already satisfactory, stop here.
Add full-text search for entity lookups
If your questions frequently involve specific names, identifiers, or dates, add a full-text index and route those queries to keyword search.
Enable graph traversal for structured facts
Once your KG has sufficient entity and relationship coverage, graph traversal retrieval significantly improves factual precision — especially for questions that span multiple hops.