Telemetry Configuration

Overview

Amp provides comprehensive observability through OpenTelemetry, exporting metrics and traces to compatible backends. The built-in telemetry stack includes support for Prometheus metrics and distributed tracing.

OpenTelemetry Configuration

Configure OpenTelemetry in the [opentelemetry] section:

[opentelemetry]
metrics_url = "http://localhost:4318/v1/metrics"
metrics_export_interval_secs = 60.0
trace_url = "http://localhost:4318/v1/traces"
trace_ratio = 1.0

Configuration Fields

opentelemetry.metrics_url

string

Remote OpenTelemetry metrics collector endpoint (binary HTTP protocol).The endpoint should accept OTLP metrics in binary format over HTTP.

[opentelemetry]
metrics_url = "http://localhost:4318/v1/metrics"

Environment variable override:

export AMP_CONFIG_OPENTELEMETRY__METRICS_URL="http://otel-collector:4318/v1/metrics"

opentelemetry.metrics_export_interval_secs

float

default:"60.0"

Interval in seconds between metric exports to the collector.Lower values provide fresher metrics but increase network traffic.

[opentelemetry]
metrics_export_interval_secs = 30.0  # Export every 30 seconds

opentelemetry.trace_url

string

Remote OpenTelemetry traces collector endpoint (HTTP protocol).The endpoint should accept OTLP traces over HTTP.

[opentelemetry]
trace_url = "http://localhost:4318/v1/traces"

Environment variable override:

export AMP_CONFIG_OPENTELEMETRY__TRACE_URL="http://otel-collector:4318/v1/traces"

opentelemetry.trace_ratio

float

default:"1.0"

Ratio of traces to sample (0.0 to 1.0).

1.0: Sample all traces (100%)
0.1: Sample 10% of traces
0.01: Sample 1% of traces

Lower ratios reduce overhead in high-traffic environments.

[opentelemetry]
trace_ratio = 0.1  # Sample 10% of traces

Local Development with Grafana

For local development and testing, Amp provides a pre-configured Grafana telemetry stack using Docker Compose.

Starting the Stack

Start the Grafana OTEL stack:

docker-compose up -d

This runs the grafana/otel-lgtm image, which includes:

Grafana: Visualization and dashboards (port 3000)
OpenTelemetry Collector: Receives metrics and traces
Prometheus: Stores metrics
Tempo: Stores traces

Learn more about the image: grafana/docker-otel-lgtm

Connecting Amp to Grafana Stack

Configure Amp to send telemetry to the local Grafana stack:

[opentelemetry]
trace_url = "http://localhost:4317/v1/traces"
metrics_url = "http://localhost:4318/v1/metrics"

Port notes:

4317: gRPC endpoint (for trace_url in this setup)
4318: HTTP endpoint (for metrics_url)

Ensure your docker-compose.yaml exposes these ports (default configuration does).

Accessing Grafana

Open Grafana in your browser:

http://localhost:3000

Default credentials:

Username: admin
Password: admin (you’ll be prompted to change it)

Metrics

Amp exports comprehensive metrics for monitoring:

Server Metrics

Query execution: Query count, duration, errors
Streaming: Active streams, microbatch size/duration
HTTP metrics: Request count, duration, status codes (via OpenTelemetry instrumentation)

Worker Metrics

Extraction: Rows ingested, blocks processed, extraction duration
File operations: Files written, bytes written
Compaction: Files compacted, compaction success/failures
Garbage collection: Files deleted, expired files, collection success/failures

Extractor Metrics

EVM RPC: RPC requests, errors, duration
Firehose: Blocks received, stream errors, connection status

For a complete list of metrics, see the Metrics Reference.

Traces

Distributed traces help debug query execution and extraction flows:

Trace Coverage

Query execution: Full query lifecycle from request to response
Extraction jobs: Block fetching, parsing, writing
File operations: Parquet writes, compaction, garbage collection
RPC calls: External provider requests

Viewing Traces in Grafana

Navigate to Explore in Grafana
Select Tempo as the data source
Search by:
- Trace ID: Direct trace lookup
- Service name: Filter by component (e.g., amp-server, amp-worker)
- Operation: Filter by operation type
- Duration: Find slow traces

Custom Dashboards

Amp includes pre-configured Grafana dashboards in the grafana/dashboards/ directory.

Loading Custom Dashboards

Navigate to: http://localhost:3000/dashboards/import
Upload the dashboard JSON file or paste its contents
Select Prometheus as the data source
Click Import

Creating New Dashboards

Create a dashboard in Grafana UI
Click Share → Export → Save to file
Save the JSON file to grafana/dashboards/ in the repository
The dashboard will be automatically loaded on next docker-compose up

The docker-compose.yaml is configured to automatically load dashboards from grafana/dashboards/ on startup.

Available Pre-Configured Dashboards

Check the grafana/dashboards/ directory in the source repository for:

Worker Overview: Extraction metrics, job status, throughput
Server Overview: Query performance, active streams, error rates
Storage Overview: File counts, bytes written, compaction status

Production Deployments

Using Grafana Cloud

Configure Amp to send telemetry to Grafana Cloud:

[opentelemetry]
metrics_url = "https://<instance>.grafana.net/otlp/v1/metrics"
trace_url = "https://<instance>.grafana.net/otlp/v1/traces"

Add authentication headers via OpenTelemetry Collector or use the Grafana Cloud OTLP endpoint with authentication.

Using Custom OpenTelemetry Collector

Deploy your own OpenTelemetry Collector:

[opentelemetry]
metrics_url = "http://otel-collector.example.com:4318/v1/metrics"
trace_url = "http://otel-collector.example.com:4318/v1/traces"

The collector can forward to multiple backends (Prometheus, Tempo, Jaeger, etc.).

Sampling Configuration for Production

Reduce trace volume in production:

[opentelemetry]
trace_ratio = 0.01  # Sample 1% of traces

Sampling strategies:

High traffic: 0.01 (1%) or lower
Medium traffic: 0.1 (10%)
Low traffic / debugging: 1.0 (100%)

Security Considerations

Never expose OpenTelemetry endpoints directly to the internet. Use authentication, VPCs, or private networks.

Best practices:

Use HTTPS endpoints for remote collectors
Implement authentication at the collector level
Restrict network access to telemetry endpoints
Avoid sending sensitive data in trace attributes
Use collector-side filtering for PII

Alerting

Set up alerts in Grafana for:

Critical Alerts

Extraction failures: Worker job failures
Database connection failures: Metadata database unavailable
High error rates: Query errors above threshold
Compaction failures: Failed compaction jobs

Warning Alerts

High query latency: P95 latency above threshold
Storage growth: Rapid increase in bytes written
File count growth: Too many small files (compaction needed)
Memory usage: High memory usage on server or workers

Example Alert Rules

# Query error rate > 5%
alert: HighQueryErrorRate
expr: rate(amp_query_errors_total[5m]) / rate(amp_queries_total[5m]) > 0.05
for: 5m
labels:
  severity: warning

# Worker extraction failures
alert: ExtractionFailures
expr: increase(amp_worker_job_failures_total[10m]) > 0
for: 1m
labels:
  severity: critical

Disabling Telemetry

To disable telemetry, simply omit the [opentelemetry] section from your config file or leave the URLs empty:

# Telemetry disabled (no opentelemetry section)

Or explicitly set empty values:

[opentelemetry]
# Empty URLs = no telemetry
metrics_url = ""
trace_url = ""

Troubleshooting

Metrics Not Appearing

Check configuration: Verify metrics_url is correct
Check collector logs: Ensure collector is receiving metrics
Check network: Verify connectivity from Amp to collector
Check interval: Metrics export every metrics_export_interval_secs

Traces Not Appearing

Check configuration: Verify trace_url is correct
Check sampling: Ensure trace_ratio is > 0
Check collector: Verify collector is configured for traces
Generate traffic: Some traces may not appear until traffic flows

High Memory Usage

If telemetry causes high memory usage:

Reduce trace_ratio to sample fewer traces
Increase metrics_export_interval_secs to export less frequently
Check collector backpressure and buffering

Next Steps

Metrics Reference

Complete list of available metrics

Configuration Overview

Back to configuration overview

Get Started

Core Concepts

Configuration

Querying Data

Data Sources

Administration

Deployment

​Overview

​OpenTelemetry Configuration

​Configuration Fields

​Local Development with Grafana

​Starting the Stack

​Connecting Amp to Grafana Stack

​Accessing Grafana

​Metrics

​Server Metrics

​Worker Metrics

​Extractor Metrics

​Traces

​Trace Coverage

​Viewing Traces in Grafana

​Custom Dashboards

​Loading Custom Dashboards

​Creating New Dashboards

​Available Pre-Configured Dashboards

​Production Deployments

​Using Grafana Cloud

​Using Custom OpenTelemetry Collector

​Sampling Configuration for Production

​Security Considerations

​Alerting

​Critical Alerts

​Warning Alerts

​Example Alert Rules

​Disabling Telemetry

​Troubleshooting

​Metrics Not Appearing

​Traces Not Appearing

​High Memory Usage

​Next Steps

Metrics Reference

Configuration Overview

Build docs developers (and LLMs) love

Overview

OpenTelemetry Configuration

Configuration Fields

Local Development with Grafana

Starting the Stack

Connecting Amp to Grafana Stack

Accessing Grafana

Metrics

Server Metrics

Worker Metrics

Extractor Metrics

Traces

Trace Coverage

Viewing Traces in Grafana

Custom Dashboards

Loading Custom Dashboards

Creating New Dashboards

Available Pre-Configured Dashboards

Production Deployments

Using Grafana Cloud

Using Custom OpenTelemetry Collector

Sampling Configuration for Production

Security Considerations

Alerting

Critical Alerts

Warning Alerts

Example Alert Rules

Disabling Telemetry

Troubleshooting

Metrics Not Appearing

Traces Not Appearing

High Memory Usage

Next Steps