LLMObs API

The LLMObs SDK is available as tracer.llmobs. It provides methods for tracing LLM operations, annotating spans, and submitting evaluation metrics.

const tracer = require('dd-trace').init({
  llmobs: {
    mlApp: 'my-llm-app',
  },
})
const llmobs = tracer.llmobs

Properties

`enabled`

Whether LLM Observability tracing is currently enabled.

enabled: boolean

Methods

`enable(options)`

Enables LLM Observability tracing programmatically.

enable(options: LLMObsEnableOptions): void

options

LLMObsEnableOptions

required

Enable options.

Show LLMObsEnableOptions properties

mlApp

string

The name of your ML application. Also configurable via DD_LLMOBS_ML_APP.

agentlessEnabled

boolean

Set to true to disable sending data that requires a Datadog Agent. Also configurable via DD_LLMOBS_AGENTLESS_ENABLED.

tracer.llmobs.enable({ mlApp: 'my-chat-app' })

`disable()`

Disables LLM Observability tracing.

disable(): void

tracer.llmobs.disable()

`trace(options, fn)`

Instruments a function by automatically creating an LLMObs span that is activated on its scope.

trace<T>(options: LLMObsNamedSpanOptions, fn: (span: Span, done: (error?: Error) => void) => T): T

options

LLMObsNamedSpanOptions

required

LLMObs span options.

Show LLMObsNamedSpanOptions properties

name

string

required

The name of the traced operation.

kind

string

required

LLM Observability span kind. One of: agent, workflow, task, tool, retrieval, embedding, llm.

modelName

string

The invoked model name. Only used on llm and embedding spans.

modelProvider

string

The model provider (e.g., openai). Only used on llm and embedding spans. Defaults to custom.

sessionId

string

The user session ID. Required for session tracking.

mlApp

string

ML application name. Overrides the default set at initialization.

Function

required

The function to instrument. Receives the active span and an optional done callback.

returns

The return value of fn.

const response = await tracer.llmobs.trace(
  {
    kind: 'llm',
    name: 'openai.chat',
    modelName: 'gpt-4o',
    modelProvider: 'openai',
    sessionId: 'user-session-123',
  },
  async (span) => {
    const result = await openai.chat.completions.create({
      model: 'gpt-4o',
      messages: [{ role: 'user', content: 'Hello!' }],
    })

    tracer.llmobs.annotate(span, {
      inputData: [{ role: 'user', content: 'Hello!' }],
      outputData: { role: 'assistant', content: result.choices[0].message.content },
      metrics: {
        inputTokens: result.usage.prompt_tokens,
        outputTokens: result.usage.completion_tokens,
        totalTokens: result.usage.total_tokens,
      },
    })

    return result
  }
)

`wrap(options, fn)`

Wraps a function so that an LLMObs span is created automatically each time the function is called.

wrap<T = (...args: any[]) => any>(options: LLMObsNamelessSpanOptions, fn: T): T

options

LLMObsNamelessSpanOptions

required

LLMObs span options. The name field is optional — if omitted, the function name is used.

Function

required

The function to wrap.

returns

A wrapped function with the same signature.

async function callLLM(prompt) {
  const result = await openai.chat.completions.create({
    model: 'gpt-4o',
    messages: [{ role: 'user', content: prompt }],
  })
  return result.choices[0].message.content
}

const tracedCallLLM = tracer.llmobs.wrap(
  { kind: 'llm', modelName: 'gpt-4o', modelProvider: 'openai' },
  callLLM
)

const response = await tracedCallLLM('What is 2 + 2?')

`annotate(options)` / `annotate(span, options)`

Sets inputs, outputs, tags, metadata, and metrics on a given LLMObs span. With the exception of tags, this method overwrites any existing values for the provided fields.

annotate(options: AnnotationOptions): void
annotate(span: Span | undefined, options: AnnotationOptions): void

span

Span

The span to annotate. Defaults to the current active LLMObs span if not provided.

options

AnnotationOptions

required

Annotation data.

Show AnnotationOptions properties

inputData

Input data. For llm spans: a string or {content, role} message(s). For embedding spans: a string or {text, ...} object(s). For other spans: any JSON-serializable value.

outputData

Output data. For llm spans: a string or {content, role} message. For retrieval spans: {name, id, text, score} object(s). For other spans: any JSON-serializable value.

metadata

object

JSON-serializable key-value metadata relevant to the operation.

metrics

object

Numeric key-value metrics, such as { inputTokens, outputTokens, totalTokens }.

`flush()`

Flushes any remaining spans and evaluation metrics to LLM Observability.

flush(): void

// At process shutdown
process.on('beforeExit', () => {
  tracer.llmobs.flush()
})

`exportSpan(span?)`

Returns an object containing the span ID and trace ID for the given span. Used with submitEvaluation().

exportSpan(span?: Span): ExportedLLMObsSpan

span

Span

The span to export. Defaults to the current active LLMObs span.

returns

ExportedLLMObsSpan

An object with spanId and traceId strings.

const spanContext = tracer.llmobs.exportSpan(span)
console.log(spanContext.spanId)
console.log(spanContext.traceId)

`submitEvaluation(spanContext, options)`

Submits a custom evaluation metric for a span.

submitEvaluation(spanContext: ExportedLLMObsSpan, options: EvaluationOptions): void

spanContext

ExportedLLMObsSpan

required

The exported span context from exportSpan().

options

EvaluationOptions

required

Evaluation metric data.

Show EvaluationOptions properties

label

string

required

The name of the evaluation metric.

metricType

string

required

One of: categorical, score, boolean, json.

value

string | number | boolean | object

required

The metric value. Must match the metricType: string for categorical, number for score, boolean for boolean, object for json.

`annotationContext(options, fn)`

Annotates all spans (including auto-instrumented spans) created within the callback with the provided context options.

annotationContext<T>(options: AnnotationContextOptions, fn: () => T): T

options

AnnotationContextOptions

required

Context annotation options.

Show AnnotationContextOptions properties

`routingContext(options, fn)`

Executes a function within a routing context that directs all LLMObs spans to a specific Datadog organization.

routingContext<T>(options: RoutingContextOptions, fn: () => T): T

options

RoutingContextOptions

required

Routing context options.

Show RoutingContextOptions properties

ddApiKey

string

required

The Datadog API key for the target organization.

ddSite

string

The Datadog site for the target organization (e.g., datadoghq.eu).

returns

The return value of fn.

tracer.llmobs.routingContext(
  { ddApiKey: 'customer-api-key', ddSite: 'datadoghq.eu' },
  () => {
    // Spans sent to customer's Datadog org
    processCustomerRequest()
  }
)

`registerProcessor(processor)`

Registers a processor function that is called on each LLMObs span before it is sent. The processor can modify the span or return null to drop it.

registerProcessor(processor: (span: LLMObservabilitySpan) => LLMObservabilitySpan | null): void

processor

Function

required

A function receiving an LLMObservabilitySpan. Return the (possibly modified) span to include it, or null to drop it.

tracer.llmobs.registerProcessor((span) => {
  // Redact sensitive content from inputs
  span.input.forEach(msg => {
    msg.content = msg.content.replace(/\d{4}[- ]?\d{4}[- ]?\d{4}[- ]?\d{4}/g, '[REDACTED]')
  })
  return span
})

`deregisterProcessor()`

Deregisters the currently registered processor.

deregisterProcessor(): void

Span kinds

Kind	Description
`llm`	A call to a language model. Use with `modelName` and `modelProvider`.
`embedding`	A call to an embedding model. Use with `modelName` and `modelProvider`.
`retrieval`	A document retrieval operation.
`tool`	A tool or function call in an agentic workflow.
`task`	A generic processing task.
`workflow`	A multi-step workflow or pipeline.
`agent`	An autonomous agent orchestrating multiple operations.

Tracer API

OpenTelemetry

OpenTracing

Additional APIs

Properties

`enabled`

Methods

`enable(options)`

`disable()`

`trace(options, fn)`

`wrap(options, fn)`

`annotate(options)` / `annotate(span, options)`

`flush()`

`exportSpan(span?)`

`submitEvaluation(spanContext, options)`

`annotationContext(options, fn)`

`routingContext(options, fn)`

`registerProcessor(processor)`

`deregisterProcessor()`

Span kinds

Build docs developers (and LLMs) love

Tracer API

OpenTelemetry

OpenTracing

Additional APIs

Documentation Index

​Properties

​enabled

​Methods

​enable(options)

​disable()

​trace(options, fn)

​wrap(options, fn)

​annotate(options) / annotate(span, options)

​flush()

​exportSpan(span?)

​submitEvaluation(spanContext, options)

​annotationContext(options, fn)

​routingContext(options, fn)

​registerProcessor(processor)

​deregisterProcessor()

​Span kinds

Build docs developers (and LLMs) love

Properties

`enabled`

Methods

`enable(options)`

`disable()`

`trace(options, fn)`

`wrap(options, fn)`

`annotate(options)` / `annotate(span, options)`

`flush()`

`exportSpan(span?)`

`submitEvaluation(spanContext, options)`

`annotationContext(options, fn)`

`routingContext(options, fn)`

`registerProcessor(processor)`

`deregisterProcessor()`

Span kinds