Overview
Skyvern provides comprehensive file handling capabilities including uploading files to web forms, downloading files from websites, and parsing document content (PDFs, spreadsheets, etc.) for use in workflows.File Upload
Basic File Upload
Upload files to web forms using natural language:Multiple File Upload
Upload in Workflows
Use the upload action in task prompts:File Upload Action Type
Skyvern automatically detects file upload fields:File Download
Downloading Files
Skyvern automatically captures file downloads:Download Configuration
Skyvern automatically configures download behavior:Workflow Download Actions
Download Monitoring
Skyvern waits for downloads to complete:File Parsing
Parse document content and use it in workflows:PDF Parsing
Supported File Types
- PDF: Extract text content from PDF documents
- Spreadsheets: Parse CSV, Excel files
- Images: OCR and image analysis
- Documents: Word docs, text files
File Parser Block
Use in workflow definitions:File Storage
Uploading Files to Storage
Upload files to S3/block storage:API Endpoint
File Information Schema
Advanced Use Cases
1. Invoice Processing Workflow
2. Bulk Document Upload
3. Download and Parse Pattern
4. Form Auto-Fill from Document
Best Practices
File Upload Best Practices
- Use Descriptive Prompts
- Validate File Size
- Handle Upload Errors
File Download Best Practices
- Always Check Downloaded Files
- Use Presigned URLs Quickly
File Parsing Best Practices
- Validate Parsed Content
- Handle Large Documents
Troubleshooting
File Upload Issues
File not uploading:Download Issues
Downloads not captured:- Ensure download links are clicked, not just navigation
- Check if site uses JavaScript for downloads
- Verify download completed within timeout period
Parsing Issues
Parsed content is empty:- Verify file is not encrypted or password-protected
- Check file format is supported
- Ensure file was fully downloaded before parsing
Related Documentation
- Browser Sessions - Managing browser state
- Workflows - Building multi-step automations
- Data Extraction - Extracting structured data