Understanding costs
Skyvern costs are primarily driven by:- LLM token usage — The largest cost factor
- Step count — More steps = more LLM calls
- Proxy usage — Residential proxies cost more than datacenter
- Browser time — Time spent with browser session active
Optimize token usage
Token usage is the biggest cost driver. Here’s how to reduce it:Use concise prompts
Use concise prompts
Problem: Verbose prompts increase token costs on every step.Solution: Be specific but concise.
Use SECONDARY_LLM_KEY for simple tasks
Use SECONDARY_LLM_KEY for simple tasks
What it does: Skyvern can use a lighter, cheaper model for simple operations.Setup:Benefit: Can reduce costs by 50-80% for straightforward tasks.
Limit screenshot scrolls
Limit screenshot scrolls
Problem: Default: 0 (no scrolling)
Recommendation: Use 2-5 only when content requires scrolling
max_screenshot_scrolls captures more content but increases tokens.Solution: Only use scrolling when necessary.Use prompt caching (when available)
Use prompt caching (when available)
What it does: LLM providers cache repeated prompts to reduce costs.How it helps: If running similar tasks repeatedly, caching can reduce costs by 50-90%.Note: Available with certain LLM providers (Anthropic Claude, OpenAI GPT-4, etc.)
Optimize step count
Fewer steps = fewer LLM calls = lower costs and faster execution.Combine actions in prompts
Combine actions in prompts
Problem: Multiple separate prompts for related actions.Benefit: 1 step instead of 3+
Add explicit completion criteria
Add explicit completion criteria
Problem: AI doesn’t know when to stop, wasting steps.Solution: Be explicit about completion.Benefit: Prevents unnecessary verification steps.
Use workflows for multi-step processes
Use workflows for multi-step processes
Problem: Single task trying to do too much.Solution: Break into workflow blocks with clear boundaries.Example workflow:
- Block 1: Navigate to invoices page
- Block 2: Extract list of invoices (pass to next block)
- Block 3: Loop through invoices and download each
Set appropriate max_steps
Set appropriate max_steps
Problem: Default Benefit: Prevents wasted steps on tasks that should fail fast.
max_steps may be too high or too low.Solution: Set based on task complexity.Optimize browser session time
Start at the right URL
Start at the right URL
Problem: Starting at homepage and navigating wastes steps.Benefit: Faster execution, fewer steps.
Use browser profiles for logged-in states
Use browser profiles for logged-in states
Problem: Logging in on every run wastes time and steps.Solution: Use browser profiles to persist login state.Benefit: Skip login steps entirely, faster execution, lower costs.
Batch similar tasks
Batch similar tasks
Problem: Creating new browser session for each task.Solution: Reuse browser session for multiple tasks.Benefit: Avoid browser startup overhead.
Optimize proxy usage
Use datacenter proxies when possible
Use datacenter proxies when possible
Cost comparison:
- Datacenter proxy: Lower cost
- Residential proxy: Higher cost
- Residential ISP proxy: Highest cost
- Try without proxy first
- Use datacenter proxy if blocked
- Use
RESIDENTIAL_ISPonly if absolutely necessary (login issues, strict anti-bot)
Choose the right proxy location
Choose the right proxy location
Problem: Using international proxy for domestic site.Solution: Match proxy location to site’s target region.Benefit: Better performance, less likely to be blocked.
Data extraction optimization
Use specific schemas
Use specific schemas
Problem: Open-ended extraction wastes tokens.Benefit: Focused extraction, consistent output, fewer tokens.
Extract only what you need
Extract only what you need
Problem: Extracting unnecessary data.Solution: Only request fields you actually need.Benefit: Less processing, smaller responses, lower costs.
Monitoring and measurement
Track key metrics
Track key metrics
Monitor these metrics to identify optimization opportunities:
- Average step count per task type
- Token usage per run
- Task duration (start to completion)
- Failure rate by task type
- Cost per successful run
Use the timeline API
Use the timeline API
What it shows: Detailed breakdown of each step.Use this to:
- Identify which blocks take the most time
- Find blocks with excessive steps
- Optimize high-cost operations
A/B test prompt variations
A/B test prompt variations
Strategy: Test different prompt formulations.Example:
- Version A: Detailed step-by-step instructions
- Version B: Concise goal-oriented prompt
- Success rate
- Average step count
- Cost per run
Cost reduction checklist
Performance benchmarks
Typical performance targets:| Task Type | Steps | Duration | Cost (approx) |
|---|---|---|---|
| Simple form fill | 5-10 | 30-60s | $0.05-0.15 |
| Login + navigation | 10-20 | 1-2 min | $0.15-0.30 |
| Data extraction | 15-30 | 2-3 min | $0.30-0.60 |
| Complex workflow | 30-50 | 3-5 min | $0.60-1.20 |
Actual costs vary based on LLM provider, model, proxy usage, and task complexity.
Next steps
Error Handling
Implement robust error handling
Webhooks
Use webhooks instead of polling
Task Parameters
Optimize task configuration
Workflow Blocks
Build efficient workflows