Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/vrashmanyu605-eng/devops-root-cause-analysis-agent/llms.txt

Use this file to discover all available pages before exploring further.

The RCA Agent treats every observability platform as a pluggable data source. During an analysis run, enabled integrations are queried in parallel — each fetching its signals for the configured time window — and their results are merged into a unified evidence payload before the LLM reasoning step. Credentials for each integration are supplied through environment variables; which integrations are actually active at runtime is controlled separately in agent.yaml under the sources list. This separation means you can add credentials for an integration without activating it until you’re ready.
To enable or disable specific integrations at runtime without removing credentials, edit the analysis.sources list in agent.yaml. Only sources listed there are queried during analysis. See Agent Settings for the full configuration reference.

The agent supports two metrics backends. Configure one or both; the agent queries whichever are listed under analysis.sources.

Prometheus

Prometheus is queried via its HTTP API. The agent executes PromQL expressions for CPU, memory, error-rate, and latency metrics over the analysis window and identifies time-series anomalies before passing them to the LLM.
.env
PROMETHEUS_URL=http://prometheus:9090
PROMETHEUS_BEARER_TOKEN=
PROMETHEUS_URL
string
required
Base URL of your Prometheus server, without a trailing slash. For Prometheus deployed inside a Kubernetes cluster, use the in-cluster service address (e.g. http://prometheus-operated.monitoring.svc.cluster.local:9090).
PROMETHEUS_BEARER_TOKEN
string
Bearer token for Prometheus endpoints protected by token-based authentication (common on managed Prometheus services). Leave empty for unauthenticated instances. The token is sent as Authorization: Bearer <token> on every request.
To verify connectivity, run a quick instant-query against the Prometheus HTTP API:
curl "${PROMETHEUS_URL}/api/v1/query" \
  --data-urlencode 'query=up' \
  -H "Authorization: Bearer ${PROMETHEUS_BEARER_TOKEN}"
A healthy response returns "status": "success" with a result array of scrape targets.

Datadog

When Datadog is enabled, the agent uses the Datadog Metrics API to retrieve timeseries data and the Events API to pull correlated Datadog monitors that fired during the analysis window.
.env
DATADOG_API_KEY=dd_api_...
DATADOG_APP_KEY=dd_app_...
DATADOG_SITE=datadoghq.com
DATADOG_API_KEY
string
required
Datadog API key, found under Organization Settings → API Keys. This key authorizes read access to metrics, logs, and events.
DATADOG_APP_KEY
string
required
Datadog Application key, found under Organization Settings → Application Keys. Required in addition to the API key to query metric data programmatically.
DATADOG_SITE
string
default:"datadoghq.com"
The Datadog regional site your account is hosted on. Use datadoghq.eu for EU accounts, us3.datadoghq.com for US3, or ddog-gov.com for GovCloud. This value determines the base URL for all API calls.

Adding Custom Connectors

The integrations above cover the most common observability stacks, but your environment may use different tools. See the Custom Data Sources guide to learn how to implement the DataSource interface and register a new connector with the agent’s plugin system.

Build docs developers (and LLMs) love