Skip to main content

What is Agent Browser?

Agent Browser is a powerful headless browser automation tool designed specifically for AI agents. It provides a fast, reliable command-line interface for controlling browsers programmatically, with an architecture optimized for both performance and ease of use.

Fast & Efficient

Native Rust CLI with sub-millisecond parsing overhead for maximum performance.

AI-First Design

Built for AI agents with snapshot-ref workflow, JSON output, and optimal token usage.

Production Ready

Security features including authentication vault, domain allowlists, and action policies.

Cross-Platform

Works on macOS, Linux, and Windows with automatic fallback support.

Key Features

Snapshot-Ref Workflow

The optimal way to interact with web pages. Get a snapshot of interactive elements, then use refs to interact deterministically:
agent-browser snapshot -i
# Output shows interactive elements with refs: @e1, @e2, @e3...

agent-browser click @e2
agent-browser fill @e3 "user@example.com"

Fast Architecture

Agent Browser uses a client-daemon architecture:
  1. Rust CLI - Fast native binary for parsing and communication
  2. Node.js Daemon - Manages Playwright browser instance
  3. Automatic Fallback - Uses Node.js directly if native binary unavailable
The daemon starts automatically and persists between commands for fast subsequent operations.

Security Built-In

Opt-in security features for safe AI agent deployments:
  • Authentication Vault - Store credentials encrypted, LLM never sees passwords
  • Domain Allowlist - Restrict navigation to trusted domains
  • Action Policies - Gate destructive actions with policy files
  • Content Boundaries - Wrap page output to help LLMs distinguish trusted from untrusted content

Multiple Platforms

Run browsers anywhere:
  • Local - Chromium, Firefox, or WebKit via Playwright
  • iOS Simulator - Real Mobile Safari testing on macOS
  • Cloud Providers - Browserbase, Browser Use, Kernel
  • CDP Mode - Connect to existing browsers via Chrome DevTools Protocol

Get Started

Installation

Install Agent Browser globally, locally, or from source.

Quickstart

Get to a working example in under 2 minutes.

Core Concepts

Learn the snapshot-ref workflow and other key concepts.

AI Agents

Integrate with Claude, GPT, and other AI coding assistants.

Use Cases

AI Agent Automation

Perfect for AI agents that need to interact with web applications:
  • E2E testing with natural language instructions
  • Web scraping with intelligent navigation
  • Form filling and data entry
  • Visual regression testing

Development & Testing

Streamline your development workflow:
  • Automated testing of web applications
  • Screenshot comparison testing
  • Accessibility tree inspection
  • Network request monitoring

Production Deployments

Deploy AI agents that interact with the web:
  • Serverless environments (Vercel, AWS Lambda)
  • CI/CD pipelines
  • Monitoring and alerting systems
  • Automated workflows
Open Source: Agent Browser is Apache-2.0 licensed. Contributions welcome at github.com/vercel-labs/agent-browser

Build docs developers (and LLMs) love