Prefer a visual interface? Try Skyvern Cloud — no code required.
Installation Options
Skyvern can be run locally or in the cloud. Choose your preferred setup:- pip install
- Docker Compose
- Skyvern Cloud
Prerequisites
- Python 3.11.x (works with 3.12, not ready for 3.13)
- NodeJS & NPM
- Rust
- VS Code with C++ dev tools and Windows SDK
Install Skyvern
Run Skyvern
Your First Task
Get your API key
Sign up at app.skyvern.com and go to Settings to copy your API key.When you make API calls, Skyvern spins up a cloud browser, executes your task with AI, and returns the results.
Run your first task
Let’s scrape the title of the #1 post on Hacker News. You only need two parameters:
prompt— Natural language instructions for what the AI should dourl— The starting page
Check the status
Since tasks run asynchronously, poll every 5 seconds until the task completes:Run states:
created— Task initialized, not yet queuedqueued— Waiting for an available browserrunning— AI is navigating and executingcompleted— Task finished successfullyfailed— Task encountered an errorterminated— Task was manually stoppedtimed_out— Task exceeded time limitcanceled— Task was cancelled before starting
SDK Usage Examples
Skyvern is a Playwright extension that adds AI-powered browser automation. It gives you the full power of Playwright with additional AI capabilities.AI-Powered Page Commands
Core AI Commands
| Command | Description |
|---|---|
page.act(prompt) | Perform actions using natural language (e.g., “Click the login button”) |
page.extract(prompt, schema) | Extract structured data from the page with optional JSON schema |
page.validate(prompt) | Validate page state, returns bool (e.g., “Check if user is logged in”) |
page.prompt(prompt, schema) | Send arbitrary prompts to the LLM with optional response schema |
Extract Structured Data
Run Locally
Run Skyvern with a browser on your own machine for development, debugging, or automating internal tools.Set up local Skyvern
- Set up your database (detects local PostgreSQL or uses Docker)
- Configure your LLM provider
- Choose browser mode (headless, headful, or connect to existing Chrome)
- Generate local API credentials
- Download the Chromium browser
Helpful Commands
Next Steps
Core Concepts
Learn about Tasks, Workflows, Browser Sessions, and more
Extract Structured Data
Define a schema to get typed JSON output from your automations
Handle Logins
Store credentials securely for sites that require authentication
Build Workflows
Chain multiple steps together for complex automations