NorthStar is an observability, debugging, and evaluation platform built for AI agent development. Instrument your agents with a single decorator or context manager, and get full visibility into every trace, LLM call, tool invocation, token cost, and error — all flowing into a real-time dashboard.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/sidmanale643/northstar/llms.txt
Use this file to discover all available pages before exploring further.
Quickstart
Get a traced agent running in under five minutes
Auto-Instrumentation
One-line setup for OpenAI and Anthropic
Evaluations
Run deterministic and LLM-based evals on agent runs
SDK Reference
Full reference for every public API
How NorthStar Works
Data flows from your agent application through the NorthStar SDK to a Supabase-backed ingest pipeline, and is then surfaced in a Next.js dashboard.Key Features
Trace & Span Hierarchy
Sessions → Runs → Spans → Events with automatic context propagation via
ContextVarAuto-Instrumentation
Patches OpenAI and Anthropic clients to capture messages, tool calls, tokens, and USD cost automatically
Versioned Prompts
Store, label, and pull prompt templates server-side; compile with Jinja or Python variable syntax
Eval Suite
Deterministic graders (tool sequence, regex, ground truth) plus LLM judges (rubric, faithfulness) in one framework
LLM Cost Tracking
Token counting and USD pricing via LiteLLM on every model call span
Dashboard & Playground
Real-time trace viewer, prompt playground, eval runs, and alert rules in a hosted Next.js UI