HandstagesAgentToolHandlers interface reference

HandstagesAgentToolHandlers is the interface you implement to connect the tool definitions in handstagesAgentTools to an actual running browser. Each method in the interface receives a typed input object (inferred from the tool’s Zod schema) and returns a typed output object. You pass your implementation’s methods as the execute function on each tool when setting up the Vercel AI SDK.

Supporting types

`HandstagesAgentContext`

HandstagesAgentContext represents the browser context that your handler implementation typically holds onto. It maps directly to the context object you get from V3.connectLocal().

import type { HandstagesAgentContext } from "@handstage/agent"

interface HandstagesAgentContext {
  pages(): Page[]
  activePage(): Page | undefined
  setActivePage(page: Page): void
  newPage(url?: string): Promise<Page>
}

Method	Description
`pages()`	Returns all currently open `Page` instances in the context.
`activePage()`	Returns the foreground `Page`, or `undefined` if none is active.
`setActivePage(page)`	Brings the given `Page` to the foreground.
`newPage(url?)`	Opens a new `Page`, optionally navigating to `url` immediately.

`HandstagesAgent` namespace

The HandstagesAgent namespace re-exports typed input and output types for every tool, inferred directly from the Zod schemas in handstagesAgentTools. Use these types to annotate your handler methods without manually writing interface types.

import type { HandstagesAgent } from "@handstage/agent"

Type	Description
`HandstagesAgent.ToolName`	Union of all 17 tool name strings.
`HandstagesAgent.PagesInput`	Input for `pages` — `{}`
`HandstagesAgent.PagesOutput`	Output for `pages` — `{ pages: PageEntry[] }`
`HandstagesAgent.PageEntry`	Single entry in `PagesOutput.pages`
`HandstagesAgent.NewPageInput`	Input for `newPage` — `{ url?: string }`
`HandstagesAgent.NewPageOutput`	Output for `newPage` — `{ pageId: string }`
`HandstagesAgent.SetActivePageInput`	Input for `setActivePage` — `{ pageId: string }`
`HandstagesAgent.SetActivePageOutput`	Output for `setActivePage` — `{ ok: true } \| { ok: false; error: string }`
`HandstagesAgent.GotoInput`	Input for `goto`
`HandstagesAgent.GotoOutput`	Output for `goto`
`HandstagesAgent.ReloadInput`	Input for `reload`
`HandstagesAgent.ReloadOutput`	Output for `reload`
`HandstagesAgent.GoBackInput`	Input for `goBack`
`HandstagesAgent.GoBackOutput`	Output for `goBack`
`HandstagesAgent.GoForwardInput`	Input for `goForward`
`HandstagesAgent.GoForwardOutput`	Output for `goForward`
`HandstagesAgent.SnapshotInput`	Input for `snapshot`
`HandstagesAgent.SnapshotOutput`	Output for `snapshot`
`HandstagesAgent.PageInfoInput`	Input for `pageInfo`
`HandstagesAgent.PageInfoOutput`	Output for `pageInfo`
`HandstagesAgent.ClickInput`	Input for `click`
`HandstagesAgent.ClickOutput`	Output for `click`
`HandstagesAgent.HoverInput`	Input for `hover`
`HandstagesAgent.HoverOutput`	Output for `hover`
`HandstagesAgent.ScrollInput`	Input for `scroll`
`HandstagesAgent.ScrollOutput`	Output for `scroll`
`HandstagesAgent.TypeInput`	Input for `type`
`HandstagesAgent.TypeOutput`	Output for `type`
`HandstagesAgent.ClickOnInput`	Input for `click_on`
`HandstagesAgent.ClickOnOutput`	Output for `click_on`
`HandstagesAgent.FillOnInput`	Input for `fill_on`
`HandstagesAgent.FillOnOutput`	Output for `fill_on`
`HandstagesAgent.TypeOnInput`	Input for `type_on`
`HandstagesAgent.TypeOnOutput`	Output for `type_on`
`HandstagesAgent.HoverOnInput`	Input for `hover_on`
`HandstagesAgent.HoverOnOutput`	Output for `hover_on`
`HandstagesAgent.OkResult`	`{ ok: true }`
`HandstagesAgent.ErrResult`	`{ ok: false; error: string }`

`HandstagesAgentToolHandlers` interface

Each method in HandstagesAgentToolHandlers corresponds to one of the 17 tools in handstagesAgentTools. Your implementation is responsible for translating tool inputs into Handstage browser operations and returning the correct output shape.

import type { HandstagesAgentToolHandlers } from "@handstage/agent"

pages(input)

Returns all open tabs in the browser context.Signature

pages(input: HandstagesAgent.PagesInput): Promise<HandstagesAgent.PagesOutput>

Input

input

{}

No fields required.

Output

pages

object[]

required

Array of tab entries, each with pageId, url, title, and activated.

newPage(input)

Opens a new browser tab, optionally navigating to a URL.Signature

newPage(input: HandstagesAgent.NewPageInput): Promise<HandstagesAgent.NewPageOutput>

Input

url

string

Optional starting URL. Defaults to "about:blank".

Output

pageId

string

required

Unique identifier for the new tab.

setActivePage(input)

Brings a tab to the foreground.Signature

setActivePage(input: HandstagesAgent.SetActivePageInput): Promise<HandstagesAgent.SetActivePageOutput>

Input

pageId

string

required

ID of the tab to focus.

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

goto(input)

Navigates a tab to a URL.Signature

goto(input: HandstagesAgent.GotoInput): Promise<HandstagesAgent.GotoOutput>

Input

pageId

string

required

Target tab ID.

url

string

required

Destination URL.

waitUntil

"load" | "domcontentloaded" | "networkidle"

Lifecycle event to await.

timeoutMs

number

Navigation timeout in milliseconds.

Output

true | false

required

true on success; false on failure.

url

string

Final URL after navigation. Only present when ok is true.

error

string

Error message when ok is false.

reload(input)

Reloads the current document in a tab.Signature

reload(input: HandstagesAgent.ReloadInput): Promise<HandstagesAgent.ReloadOutput>

Input

pageId

string

required

Target tab ID.

waitUntil

"load" | "domcontentloaded" | "networkidle"

Lifecycle event to await.

timeoutMs

number

Reload timeout in milliseconds.

ignoreCache

boolean

Pass true to bypass the browser cache.

Output

true | false

required

true on success; false on failure.

url

string

URL after reload. Only present when ok is true.

error

string

Error message when ok is false.

goBack(input)

Goes back one step in a tab’s session history.Signature

goBack(input: HandstagesAgent.GoBackInput): Promise<HandstagesAgent.GoBackOutput>

Input

pageId

string

required

Target tab ID.

waitUntil

"load" | "domcontentloaded" | "networkidle"

Lifecycle event to await after navigation.

timeoutMs

number

Timeout in milliseconds.

Output

true | false

required

true on success; false on failure.

navigated

boolean

Whether the tab actually navigated back. Only present when ok is true.

url

string

Current URL after the operation. Only present when ok is true.

error

string

Error message when ok is false.

goForward(input)

Goes forward one step in a tab’s session history.Signature

goForward(input: HandstagesAgent.GoForwardInput): Promise<HandstagesAgent.GoForwardOutput>

Input

pageId

string

required

Target tab ID.

waitUntil

"load" | "domcontentloaded" | "networkidle"

Lifecycle event to await after navigation.

timeoutMs

number

Timeout in milliseconds.

Output

true | false

required

true on success; false on failure.

navigated

boolean

Whether the tab actually navigated forward. Only present when ok is true.

url

string

Current URL after the operation. Only present when ok is true.

error

string

Error message when ok is false.

snapshot(input)

Captures the accessibility tree for a tab.Signature

snapshot(input: HandstagesAgent.SnapshotInput): Promise<HandstagesAgent.SnapshotOutput>

Input

pageId

string

required

Target tab ID.

includeIframes

boolean

Whether to include nodes from embedded iframes.

Output

true | false

required

true on success; false on failure.

tree

string

Accessibility tree as multiline text. Only present when ok is true.

xpathMap

Record<string, string>

Maps encoded node IDs to XPath selectors. Only present when ok is true.

urlMap

Record<string, string>

Maps encoded node IDs to link href values. Only present when ok is true.

error

string

Error message when ok is false.

pageInfo(input)

Returns the current URL and document title for a tab.Signature

pageInfo(input: HandstagesAgent.PageInfoInput): Promise<HandstagesAgent.PageInfoOutput>

Input

pageId

string

required

Target tab ID.

Output

true | false

required

true on success; false on failure.

url

string

Current URL. Only present when ok is true.

title

string

Document title. Only present when ok is true.

error

string

Error message when ok is false.

click(input)

Dispatches a mouse click at viewport coordinates.Signature

click(input: HandstagesAgent.ClickInput): Promise<HandstagesAgent.ClickOutput>

Input

pageId

string

required

Target tab ID.

number

required

Horizontal coordinate in CSS pixels.

number

required

Vertical coordinate in CSS pixels.

button

"left" | "right" | "middle"

Mouse button. Defaults to "left".

clickCount

number

Number of clicks. Positive integer.

Output

true | false

required

true on success; false on failure.

xpathAtPoint

string

XPath of the element at the clicked point, if available. Only present when ok is true.

error

string

Error message when ok is false.

hover(input)

Moves the pointer to viewport coordinates.Signature

hover(input: HandstagesAgent.HoverInput): Promise<HandstagesAgent.HoverOutput>

Input

pageId

string

required

Target tab ID.

number

required

Horizontal coordinate in CSS pixels.

number

required

Vertical coordinate in CSS pixels.

Output

true | false

required

true on success; false on failure.

xpathAtPoint

string

XPath of the element at the pointer position. Only present when ok is true.

error

string

Error message when ok is false.

scroll(input)

Dispatches a mouse wheel event at viewport coordinates.Signature

scroll(input: HandstagesAgent.ScrollInput): Promise<HandstagesAgent.ScrollOutput>

Input

pageId

string

required

Target tab ID.

number

required

Horizontal coordinate of the wheel event in CSS pixels.

number

required

Vertical coordinate of the wheel event in CSS pixels.

deltaX

number

required

Horizontal scroll delta in pixels.

deltaY

number

required

Vertical scroll delta in pixels.

Output

true | false

required

true on success; false on failure.

xpathAtPoint

string

XPath of the element at the scroll position. Only present when ok is true.

error

string

Error message when ok is false.

type(input)

Types text at the currently focused element using key events.Signature

type(input: HandstagesAgent.TypeInput): Promise<HandstagesAgent.TypeOutput>

Input

pageId

string

required

Target tab ID.

text

string

required

Text to type.

delay

number

Milliseconds between keystrokes. Non-negative.

withMistakes

boolean

Simulate human-like typing with occasional errors and corrections.

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

click_on(input)

Clicks the first element matching a CSS or XPath selector.Signature

click_on(input: HandstagesAgent.ClickOnInput): Promise<HandstagesAgent.ClickOnOutput>

Input

pageId

string

required

Target tab ID.

select

string

required

CSS selector or XPath expression (e.g., //button[@id='submit']).

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

fill_on(input)

Clears and fills an input element matched by a CSS or XPath selector.Signature

fill_on(input: HandstagesAgent.FillOnInput): Promise<HandstagesAgent.FillOnOutput>

Input

pageId

string

required

Target tab ID.

select

string

required

CSS selector or XPath expression targeting the input element.

value

string

required

New value to set.

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

type_on(input)

Focuses an element by selector, then types text into it using key events.Signature

type_on(input: HandstagesAgent.TypeOnInput): Promise<HandstagesAgent.TypeOnOutput>

Input

pageId

string

required

Target tab ID.

select

string

required

CSS selector or XPath expression targeting the element.

text

string

required

Text to type.

delay

number

Milliseconds between keystrokes. Non-negative.

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

hover_on(input)

Moves the pointer to the first element matching a CSS or XPath selector.Signature

hover_on(input: HandstagesAgent.HoverOnInput): Promise<HandstagesAgent.HoverOnOutput>

Input

pageId

string

required

Target tab ID.

select

string

required

CSS selector or XPath expression targeting the element.

Output

true | false

required

true on success; false on failure.

error

string

Error message when ok is false.

Example implementation

Below is a minimal but complete implementation of HandstagesAgentToolHandlers. It holds a reference to a HandstagesAgentContext and delegates each method to the underlying Handstage browser APIs.

import type {
  HandstagesAgent,
  HandstagesAgentContext,
  HandstagesAgentToolHandlers,
} from "@handstage/agent"
import type { Page } from "@handstage/core"

class MyBrowserHandlers implements HandstagesAgentToolHandlers {
  constructor(private ctx: HandstagesAgentContext) {}

  async pages(_input: HandstagesAgent.PagesInput): Promise<HandstagesAgent.PagesOutput> {
    return {
      pages: this.ctx.pages().map((p) => ({
        pageId: p.targetId,
        url: p.url(),
        title: p.title(),
        activated: p === this.ctx.activePage(),
      })),
    }
  }

  async newPage(input: HandstagesAgent.NewPageInput): Promise<HandstagesAgent.NewPageOutput> {
    const page = await this.ctx.newPage(input.url)
    return { pageId: page.targetId }
  }

  async setActivePage(
    input: HandstagesAgent.SetActivePageInput,
  ): Promise<HandstagesAgent.SetActivePageOutput> {
    const page = this.ctx.pages().find((p) => p.targetId === input.pageId)
    if (!page) return { ok: false, error: `No page with id ${input.pageId}` }
    this.ctx.setActivePage(page)
    return { ok: true }
  }

  async goto(input: HandstagesAgent.GotoInput): Promise<HandstagesAgent.GotoOutput> {
    const page = this.getPage(input.pageId)
    if (!page) return { ok: false, error: `No page with id ${input.pageId}` }
    try {
      await page.goto(input.url, {
        waitUntil: input.waitUntil,
        timeout: input.timeoutMs,
      })
      return { ok: true, url: page.url() }
    } catch (err) {
      return { ok: false, error: String(err) }
    }
  }

  // ... implement remaining 13 methods following the same pattern

  private getPage(pageId: string): Page | undefined {
    return this.ctx.pages().find((p) => p.targetId === pageId)
  }
}

Complete browser agent example

The following example shows how to wire everything together: create a Handstage browser, implement the handlers, attach execution to each tool, and run a browser automation task with the Vercel AI SDK.

import { generateText, tool } from "ai"
import { openai } from "@ai-sdk/openai"
import { V3 } from "@handstage/core"
import {
  handstagesAgentTools,
  type HandstagesAgent,
  type HandstagesAgentContext,
  type HandstagesAgentToolHandlers,
} from "@handstage/agent"

// Step 1 — connect to a local Chrome instance
const browser = await V3.connectLocal()
const ctx: HandstagesAgentContext = browser.context

// Step 2 — implement HandstagesAgentToolHandlers
class BrowserHandlers implements HandstagesAgentToolHandlers {
  constructor(private ctx: HandstagesAgentContext) {}

  async pages(_: HandstagesAgent.PagesInput): Promise<HandstagesAgent.PagesOutput> {
    return {
      pages: this.ctx.pages().map((p) => ({
        pageId: p.targetId,
        url: p.url(),
        title: p.title(),
        activated: p === this.ctx.activePage(),
      })),
    }
  }

  async newPage(input: HandstagesAgent.NewPageInput): Promise<HandstagesAgent.NewPageOutput> {
    const page = await this.ctx.newPage(input.url)
    return { pageId: page.targetId }
  }

  async goto(input: HandstagesAgent.GotoInput): Promise<HandstagesAgent.GotoOutput> {
    const page = this.ctx.pages().find((p) => p.targetId === input.pageId)
    if (!page) return { ok: false, error: "Page not found" }
    try {
      await page.goto(input.url, { waitUntil: input.waitUntil, timeout: input.timeoutMs })
      return { ok: true, url: page.url() }
    } catch (err) {
      return { ok: false, error: String(err) }
    }
  }

  async snapshot(input: HandstagesAgent.SnapshotInput): Promise<HandstagesAgent.SnapshotOutput> {
    const page = this.ctx.pages().find((p) => p.targetId === input.pageId)
    if (!page) return { ok: false, error: "Page not found" }
    try {
      const result = await page.snapshot({ includeIframes: input.includeIframes })
      return { ok: true, tree: result.tree, xpathMap: result.xpathMap, urlMap: result.urlMap }
    } catch (err) {
      return { ok: false, error: String(err) }
    }
  }

  // ... implement remaining methods
}

const handlers = new BrowserHandlers(ctx)

// Step 3 — attach execute functions to each tool
const executableTools = {
  ...handstagesAgentTools,
  pages: tool({ ...handstagesAgentTools.pages, execute: (i) => handlers.pages(i) }),
  newPage: tool({ ...handstagesAgentTools.newPage, execute: (i) => handlers.newPage(i) }),
  goto: tool({ ...handstagesAgentTools.goto, execute: (i) => handlers.goto(i) }),
  snapshot: tool({ ...handstagesAgentTools.snapshot, execute: (i) => handlers.snapshot(i) }),
  // ... attach remaining tools
}

// Step 4 — run the automation task
const result = await generateText({
  model: openai("gpt-4o"),
  tools: executableTools,
  maxSteps: 30,
  system: "You are a browser automation agent. Use the provided tools to complete tasks.",
  prompt: "Open https://example.com, read the page title, and return it.",
})

console.log(result.text)

await browser.close()

The execute functions are the only thing that separates handstagesAgentTools (schema only) from a fully runnable tool set. Keep your handler class separate from the AI SDK wiring so you can test each method in isolation.

@handstage/core

@handstage/agent

Documentation Index

​Supporting types

​HandstagesAgentContext

​HandstagesAgent namespace

​HandstagesAgentToolHandlers interface

​Example implementation

​Complete browser agent example

Build docs developers (and LLMs) love

Supporting types

`HandstagesAgentContext`

`HandstagesAgent` namespace

`HandstagesAgentToolHandlers` interface

Example implementation

Complete browser agent example