{
"url": "https://news.example.com/article",
"css_selector": "article.content",
"extraction_type": "markdown",
"main_content_only": true
}
Available Tools
get
Make stealth HTTP GET requests to fetch web pages. Best for: Low to mid protection levels, simple HTTP requests Parameters:The URL to request
Browser to impersonate (chrome, firefox, safari, etc.)
Output format:
markdown, html, or textCSS selector to extract specific content
Extract only content within
<body> tagCustom HTTP headers
Cookies to include in request
Proxy URL (format: “http://user:pass@host:port”)
Request timeout in seconds
Use real browser headers
bulk_get
Fetch multiple URLs concurrently with HTTP GET requests. Best for: Scraping multiple pages efficiently Parameters: Same asget, but accepts urls (array) instead of url (string).
List of URLs to fetch concurrently
fetch
Use Playwright browser automation for JavaScript-heavy sites. Best for: Single-page applications, sites requiring JavaScript execution Parameters:The URL to fetch
Output format:
markdown, html, or textRun browser in headless mode
Block images, fonts, media for speed boost
Wait for no network activity for 500ms
Timeout in milliseconds
Additional wait time in milliseconds
CSS selector to wait for before proceeding
State to wait for:
attached, detached, visible, hiddenUse real Chrome installation instead of Chromium
Set referer as Google search of domain
bulk_fetch
Fetch multiple URLs with browser automation concurrently. Best for: Scraping multiple JavaScript-heavy pages Parameters: Same asfetch, but accepts urls (array) instead of url (string).
Example usage:
stealthy_fetch
Advanced stealth browser automation with Cloudflare bypass. Best for: High protection sites, Cloudflare-protected pages Parameters: Allfetch parameters, plus:
Automatically solve Cloudflare challenges
Block WebRTC to prevent IP leaks
Allow WebGL (recommended for stealth)
Add noise to canvas fingerprinting
Additional Playwright context settings
bulk_stealthy_fetch
Fetch multiple protected URLs with advanced stealth. Best for: Scraping multiple Cloudflare-protected sites Parameters: Same asstealthy_fetch, but accepts urls (array) instead of url (string).
Example usage:
Response Format
All tools return a structured response:Extraction Types
- Markdown
- HTML
- Text
Converts HTML to clean Markdown format:Best for: Readable text, content processing, AI consumption
CSS Selectors
All tools support CSS selectors for targeted extraction:When
css_selector matches multiple elements, all matches are returned in the content array.Authentication
HTTP Basic Auth
Proxy Authentication
Common Patterns
Simple page fetch
Simple page fetch
Extract article content
Extract article content
Scrape SPA application
Scrape SPA application
Bypass Cloudflare
Bypass Cloudflare
Bulk scraping with stealth
Bulk scraping with stealth
Tool Selection Guide
Related Documentation
MCP Server
Learn about the MCP server
Setup Guide
Configure MCP server for AI clients