Query Syntax

QMD supports query documents — multi-line queries where each line specifies a search type and query text. This gives you precise control over the search pipeline.

Grammar

Query documents use a simple line-based format:

query_document = { typed_line } ;
typed_line     = type ":" text newline ;
type           = "lex" | "vec" | "hyde" ;

Query Types

Type	Method	Description
`lex`	BM25	Keyword search with exact matching
`vec`	Vector	Semantic similarity search
`hyde`	Vector	Hypothetical document embedding

lex - Lexical Search

Keyword-based search using BM25. Supports special syntax for precision:

lex: CAP theorem consistency
lex: "machine learning" -"deep learning"
lex: auth -oauth -saml

Syntax:

Pattern	Meaning	Example
`word`	Prefix match	`perf` matches “performance”
`"phrase"`	Exact phrase	`"rate limiter"`
`-word`	Exclude term	`-sports`
`-"phrase"`	Exclude phrase	`-"test data"`

Lex queries are the only type that supports exclusions (-term) and exact phrase matching ("phrase").

vec - Semantic Search

Natural language queries using vector embeddings:

vec: how does the rate limiter handle burst traffic
vec: what is the tradeoff between consistency and availability

No special syntax — just write natural language questions or descriptions.

Vector queries do not support lex syntax. This will fail:

vec: authentication -oauth  # ❌ Negation not supported

Use lex: for exclusions instead.

hyde - Hypothetical Document Embedding

Write a hypothetical answer passage (50-100 words) representing what you expect the answer to look like:

hyde: The rate limiter uses a sliding window algorithm with a 60-second window. When a client exceeds 100 requests per minute, subsequent requests return 429 Too Many Requests.

HyDE embeddings often match better than questions because they’re closer to the actual document content.

Multi-Line Query Documents

Combine multiple query types for best results:

qmd query $'lex: rate limiter algorithm\nvec: how does rate limiting work in the API\nhyde: The API implements rate limiting using a token bucket algorithm with a 100 req/min limit.'

How it works:

Each line searches independently (BM25 or vector)
Results are fused using RRF (Reciprocal Rank Fusion)
First query gets 2× weight to preserve exact matches
Top candidates are reranked by LLM

The first query line gets double weight in RRF fusion. Put your most important query first (usually lex: for exact matches).

Expand Queries (Default)

If you don’t specify query types, QMD treats input as an expand query and generates lex, vec, and hyde variants automatically:

# These are equivalent:
qmd query "error handling best practices"
qmd query "expand: error handling best practices"

The expansion model (qmd-query-expansion-1.7B) generates:

1 lexical variant
1 semantic variant
(Original query gets 2× weight)

Expansion is the default because it’s simple and works well. Use explicit typed queries when you need precise control.

Query Document Examples

Example 1: Keyword + Semantic

qmd query $'lex: "CAP theorem"\nvec: consistency vs availability tradeoffs'

Finds documents with:

Exact phrase “CAP theorem” (lex)
Semantic similarity to consistency/availability concepts (vec)

Example 2: Exclusions + Question

qmd query $'lex: performance -optimization\nvec: how to improve query speed'

Finds documents about:

Performance but NOT optimization (lex exclusion)
Improving query speed (semantic)

Example 3: All Three Types

qmd query $'lex: authentication token\nvec: how does the auth system work\nhyde: The authentication system uses JWT tokens with a 1-hour expiry. Users authenticate via OAuth 2.0 with refresh tokens.'

Combines:

Keyword matching for “authentication token”
Semantic question about auth system
Hypothetical answer matching detailed explanations

Validation Rules

Lex Query Validation

qmd query 'lex: auth -oauth'
qmd query 'lex: "machine learning" neural'

Vec/Hyde Query Validation

qmd query 'vec: how does authentication work'
qmd query 'hyde: The system uses OAuth 2.0...'

Query Document Rules

Each typed line must use lex:, vec:, or hyde: prefix
Cannot mix expand: with typed lines
Empty lines are ignored
Leading/trailing whitespace is trimmed

MCP/HTTP API Format

When using QMD via MCP, you can pass query documents as strings:

String Format

{
  "q": "lex: CAP theorem\nvec: consistency vs availability",
  "collections": ["docs"],
  "limit": 10
}

Or structured format:

Structured Format

{
  "searches": [
    { "type": "lex", "query": "CAP theorem" },
    { "type": "vec", "query": "consistency vs availability" }
  ],
  "collections": ["docs"],
  "limit": 10
}

Both formats produce identical results.

When to Use Query Documents

Precise control over search strategy

When you want to control exactly which query types are used:

qmd query $'lex: "exact phrase"\nvec: semantic question'

Combining keyword and semantic search

When you need both exact matches and conceptual similarity:

qmd query $'lex: OAuth 2.0\nvec: how does authentication work'

Using exclusions with semantic search

When you need to exclude terms while still doing semantic search:

qmd query $'lex: machine learning -"deep learning"\nvec: statistical learning algorithms'

Testing different query strategies

During development or tuning, explicit queries help test what works best:

qmd query 'lex: API design'
qmd query 'vec: API design'
qmd query $'lex: API design\nvec: API architecture patterns'

Fusion and Ranking

Multi-line queries are fused using Reciprocal Rank Fusion (RRF):

score = Σ(1 / (k + rank + 1))  where k=60

Special weighting:

First query gets 2× weight to preserve exact matches
Top-rank bonus: Documents ranking #1 get +0.05, #2-3 get +0.02
Position-aware blending after reranking:
- Rank 1-3: 75% retrieval, 25% reranker
- Rank 4-10: 60% retrieval, 40% reranker
- Rank 11+: 40% retrieval, 60% reranker

See Search Modes for pipeline details.

Best Practices

Put lex first

Place lex: queries first to give exact matches 2× weight:

qmd query $'lex: OAuth\nvec: authentication'

Use lex for exclusions

Only lex: supports -term syntax:

qmd query 'lex: auth -oauth -saml'

Keep hyde focused

Write 50-100 word passages, not entire documents:

qmd query 'hyde: The rate limiter uses a token bucket algorithm...'

Combine types strategically

Use lex for precision, vec for recall:

qmd query $'lex: "CAP theorem"\nvec: distributed systems consistency'

Search Modes - Understanding search, vsearch, and query
Embeddings - How vec and hyde work
MCP Server - Using query syntax via MCP
CLI Reference - Query command documentation

Get Started

Core Concepts

Usage Guides

Architecture

Grammar

Query Types

lex - Lexical Search

vec - Semantic Search

hyde - Hypothetical Document Embedding

Multi-Line Query Documents

Expand Queries (Default)

Query Document Examples

Example 1: Keyword + Semantic

Example 2: Exclusions + Question

Example 3: All Three Types

Validation Rules

Lex Query Validation

Vec/Hyde Query Validation

Query Document Rules

MCP/HTTP API Format

When to Use Query Documents

Fusion and Ranking

Best Practices

Put lex first

Use lex for exclusions

Keep hyde focused

Combine types strategically

Build docs developers (and LLMs) love

Get Started

Core Concepts

Usage Guides

Architecture

Documentation Index

​Grammar

​Query Types

​lex - Lexical Search

​vec - Semantic Search

​hyde - Hypothetical Document Embedding

​Multi-Line Query Documents

​Expand Queries (Default)

​Query Document Examples

​Example 1: Keyword + Semantic

​Example 2: Exclusions + Question

​Example 3: All Three Types

​Validation Rules

​Lex Query Validation

​Vec/Hyde Query Validation

​Query Document Rules

​MCP/HTTP API Format

​When to Use Query Documents

​Fusion and Ranking

​Best Practices

Put lex first

Use lex for exclusions

Keep hyde focused

Combine types strategically

​Related

Build docs developers (and LLMs) love

Grammar

Query Types

lex - Lexical Search

vec - Semantic Search

hyde - Hypothetical Document Embedding

Multi-Line Query Documents

Expand Queries (Default)

Query Document Examples

Example 1: Keyword + Semantic

Example 2: Exclusions + Question

Example 3: All Three Types

Validation Rules

Lex Query Validation

Vec/Hyde Query Validation

Query Document Rules

MCP/HTTP API Format

When to Use Query Documents

Fusion and Ranking

Best Practices

Related