Installation
This guide walks you through setting up Alpamayo 1 on your system.Requirements
Before installing, verify that your system meets these requirements:| Requirement | Specification |
|---|---|
| Python | 3.12.x |
| GPU | NVIDIA GPU with ≥24 GB VRAM (e.g., RTX 3090, RTX 4090, A5000, H100) |
| OS | Linux (tested); other platforms unverified |
| VRAM | Minimum 24 GB for inference |
Installation steps
Install uv package manager
Install the The
uv package manager if you don’t already have it:uv package manager provides fast, reliable dependency management for Python projects.Set up the virtual environment
Create and activate a virtual environment, then sync dependencies:This will install all required dependencies from
pyproject.toml, including:- PyTorch 2.8.0
- Transformers 4.57.1
- Flash Attention 2.8.3+
- Accelerate, einops, and other core dependencies
Authenticate with HuggingFace
Alpamayo 1 requires access to gated resources on HuggingFace. First, request access to:After receiving access approval, install the HuggingFace CLI and authenticate:You’ll need to provide your HuggingFace access token, which you can create at https://huggingface.co/settings/tokens.
For more details on HuggingFace authentication, see the official documentation.
Verify installation
Test your installation by running the inference script:If the script runs successfully and outputs Chain-of-Causation reasoning and minADE metrics, your installation is complete.
The first run will download example data and model weights (22 GB). Download time depends on your network bandwidth - expect around 2.5 minutes on a 100 MB/s connection.
Key dependencies
Alpamayo 1 relies on the following core dependencies (automatically installed viauv sync):
Development dependencies
For interactive notebook usage and visualization, you can install development dependencies:matplotlib- For trajectory visualizationmediapy- For image/video displayipykernelandipywidgets- For Jupyter notebook support
Troubleshooting
Flash Attention compatibility issues
If you encounter issues with Flash Attention 2, you can switch to PyTorch’s scaled dot-product attention:CUDA out-of-memory errors
If you experience OOM errors during inference:-
Verify your GPU has at least 24 GB VRAM:
-
Reduce
num_traj_sampleswhen generating multiple trajectories - Close other GPU-intensive applications
- Ensure no other processes are using GPU memory
Python version mismatch
Alpamayo 1 requires Python 3.12.x specifically. If you have a different version:HuggingFace authentication issues
If you receive authentication errors:- Ensure you’ve requested and received access to both gated resources
- Verify your token has the necessary permissions
- Try logging out and back in:
Next steps
Now that you have Alpamayo 1 installed, you’re ready to run inference:Quickstart guide
Learn how to run inference and generate trajectory predictions