Overview
OpenAVM Kit supports remote storage services for syncing input data and outputs across different environments. This is especially useful for:- Sharing datasets across team members
- Backing up your work to the cloud
- Accessing public datasets maintained by organizations
- Deploying models in production environments
Supported Storage Services
OpenAVM Kit currently supports three cloud storage methods:Microsoft Azure
Azure Blob Storage for enterprise deployments
Hugging Face
Hugging Face datasets for ML workflows
SFTP
Secure FTP for traditional file servers
Environment Configuration
All cloud storage credentials must be stored in a.env file located in the notebooks/ directory.
Creating the .env File
Create a plain text file atnotebooks/.env with your configuration:
You only need to provide values for the service you’re actually using. The library will automatically detect which cloud service to use based on the available environment variables.
Azure Blob Storage
Configure Azure Blob Storage for enterprise-grade cloud storage.Required Environment Variables
Access level for your Azure accountOptions:
read_only- Download files onlyread_write- Download and upload files
The name of your Azure storage container
The connection string for your Azure storage account. Find this in the Azure Portal under your storage account’s “Access keys” section.
Example Configuration
Getting Your Connection String
- Navigate to your Storage Account in the Azure Portal
- Go to Security + networking > Access keys
- Click Show keys
- Copy one of the connection strings
Hugging Face
Configure Hugging Face for ML-focused workflows and easy dataset sharing.Required Environment Variables
Access level for your Hugging Face accountOptions:
read_only- Download datasets onlyread_write- Download and upload datasets
The Hugging Face repository ID in format
username/repo-name or organization/repo-nameYour Hugging Face API token. Required for private repositories or write access. Optional for public read-only access.
Example Configuration
Getting Your API Token
- Go to huggingface.co/settings/tokens
- Click New token
- Give it a descriptive name like “openavmkit”
- Select the appropriate permissions (read or write)
- Copy the token and add it to your
.envfile
SFTP (Secure File Transfer Protocol)
Configure SFTP for traditional file server setups.Required Environment Variables
Access level for your SFTP accountOptions:
read_only- Download files onlyread_write- Download and upload files
The hostname or IP address of your SFTP server
Your SFTP username
Your SFTP password (required if not using key-based authentication)
The port number for your SFTP server
Example Configuration
Switching Between Cloud Services
If you work with multiple projects that use different cloud storage services, you can configure the service selection in yoursettings.json:
Force a specific cloud serviceOptions:
azurehuggingfacesftp
Override the access level from the
.env fileOptions:read_onlyread_write
Using Cloud Storage in Code
The cloud storage functionality is integrated into the pipeline:Troubleshooting
Connection Errors
Azure: Invalid connection string
Azure: Invalid connection string
Verify your connection string is complete and includes
AccountName, AccountKey, and DefaultEndpointsProtocol. Copy it directly from the Azure Portal.Hugging Face: Authentication failed
Hugging Face: Authentication failed
For private repositories or write access, ensure your
HF_TOKEN is valid and has the appropriate permissions. Regenerate the token if necessary.SFTP: Connection timeout
SFTP: Connection timeout
Check that:
- The hostname is correct and reachable
- The port is correct (default is 22)
- Your firewall allows outbound connections on the SFTP port
- The SFTP server is running
Permission Errors
If you encounter permission errors when uploading:- Verify your access is set to
read_writein the.envfile - Check that your account has write permissions on the remote storage
- For Hugging Face, ensure your token has write access
- For Azure, verify your connection string is from an account with write permissions
File Not Found
If files aren’t downloading:- Verify the file exists in the remote storage
- Check that the locality slug matches exactly
- Ensure your access level allows reading
- For Hugging Face, verify the repository ID is correct
Best Practices
Next Steps
Settings Configuration
Learn more about configuring settings.json
Getting Started
Return to the getting started guide