Installation Guide

System Requirements

Before installing QualiVision, ensure your system meets these requirements:

Python Version

Python 3.8 or higher requiredVerify: python --version

GPU Support

CUDA-capable GPU recommended

DOVER++: ~12GB VRAM
V-JEPA2: ~16GB VRAM

Storage

~10GB for models and dependenciesAdditional space for datasets

Operating System

Linux, macOS, or WindowsLinux recommended for best performance

While CPU-only execution is supported, GPU acceleration is strongly recommended for practical evaluation and training. CPU inference can be 10-50x slower.

Installation Methods

Clone the Repository

First, clone the QualiVision repository from GitHub:

git clone https://github.com/RITIK-12/QualiVision.git
cd QualiVision

This will download the complete framework including:

Model implementations
Training and evaluation scripts
Configuration files
Example notebooks

Install Dependencies

Standard Installation
Virtual Environment
Conda Environment

Install all dependencies using pip:

pip install -r requirements.txt

This is the recommended method for most users.

Use a virtual environment for isolated installation:

# Create virtual environment
python -m venv venv

# Activate (Linux/macOS)
source venv/bin/activate

# Activate (Windows)
venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Use Conda for environment management:

# Create conda environment
conda create -n qualivision python=3.8

# Activate environment
conda activate qualivision

# Install PyTorch with CUDA (Linux/Windows)
conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia

# Install remaining dependencies
pip install -r requirements.txt

Verify Installation

Verify that QualiVision is installed correctly:

python -c "import torch; print(f'PyTorch: {torch.__version__}')"
python -c "import torch; print(f'CUDA Available: {torch.cuda.is_available()}')"
python -c "import transformers; print(f'Transformers: {transformers.__version__}')"

Expected output:

PyTorch: 2.0.0+cu118
CUDA Available: True
Transformers: 4.30.0

Dependencies

QualiVision requires the following core dependencies:

Deep Learning Frameworks

torch>=2.0.0
torchvision>=0.15.0
torchaudio>=2.0.0

PyTorch and related libraries for model training and inference.

Transformers & NLP

transformers>=4.30.0
sentence-transformers>=2.2.0

For text encoding using BGE-Large and other language models.

Computer Vision

timm>=0.9.0
opencv-python>=4.7.0
decord>=0.6.0
Pillow>=9.0.0

Image and video processing libraries.

Scientific Computing

scipy>=1.10.0
scikit-learn>=1.2.0
numpy>=1.24.0
pandas>=1.5.0

Numerical computing and data processing.

Performance Optimization

accelerate>=0.20.0
flash-attn>=2.0.0
xformers>=0.0.20
einops>=0.6.0

Memory optimization and training acceleration.

Utilities & Visualization

tqdm>=4.65.0
wandb>=0.15.0
matplotlib>=3.6.0
seaborn>=0.11.0
pyyaml>=6.0

Progress bars, experiment tracking, and visualization.

Development Tools

jupyterlab>=4.0.0
ipywidgets>=8.0.0
datasets>=2.12.0

For interactive development and data loading.

Complete requirements.txt

From requirements.txt in the source repository:

torch>=2.0.0
torchvision>=0.15.0
torchaudio>=2.0.0
transformers>=4.30.0
sentence-transformers>=2.2.0
timm>=0.9.0
scipy>=1.10.0
scikit-learn>=1.2.0
opencv-python>=4.7.0
decord>=0.6.0
pandas>=1.5.0
numpy>=1.24.0
Pillow>=9.0.0
tqdm>=4.65.0
wandb>=0.15.0
accelerate>=0.20.0
datasets>=2.12.0
einops>=0.6.0
flash-attn>=2.0.0
xformers>=0.0.20
pyyaml>=6.0
argparse
matplotlib>=3.6.0
seaborn>=0.11.0
jupyterlab>=4.0.0
ipywidgets>=8.0.0

GPU Setup

CUDA (NVIDIA)
ROCm (AMD)
MPS (Apple Silicon)

For NVIDIA GPUs, ensure CUDA is properly installed:

Check CUDA Version

nvidia-smi

This shows your GPU and CUDA driver version.

Install PyTorch with CUDA

Visit PyTorch Get Started and select your CUDA version:

# For CUDA 11.8
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

# For CUDA 12.1
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

Verify CUDA

import torch
print(f"CUDA Available: {torch.cuda.is_available()}")
print(f"CUDA Version: {torch.version.cuda}")
print(f"Device Count: {torch.cuda.device_count()}")
print(f"Current Device: {torch.cuda.current_device()}")
print(f"Device Name: {torch.cuda.get_device_name(0)}")

For AMD GPUs with ROCm:

# Install PyTorch for ROCm
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.6

Check compatibility at PyTorch ROCm.

For Apple Silicon (M1/M2/M3) Macs:

# Standard PyTorch installation includes MPS support
pip3 install torch torchvision torchaudio

Verify MPS:

import torch
print(f"MPS Available: {torch.backends.mps.is_available()}")
print(f"MPS Built: {torch.backends.mps.is_built()}")

Use --device mps when running evaluation scripts on Apple Silicon.

Troubleshooting

CUDA Out of Memory

If you encounter OOM errors:

Reduce batch size:

python scripts/evaluate.py --model dover --batch-size 1 --data data/test

Use gradient checkpointing (for training)
Close other GPU applications

Use CPU if necessary:

python scripts/evaluate.py --model dover --device cpu --data data/test

Flash Attention Installation Issues

If flash-attn fails to install:

Skip flash attention (optional dependency):

pip install -r requirements.txt --no-deps
pip install <packages except flash-attn>

Or install from source:

pip install flash-attn --no-build-isolation

Flash attention is optional for inference, mainly benefits training speed

Decord Video Loading Errors

If video loading fails:

Ensure FFmpeg is installed:

# Ubuntu/Debian
sudo apt-get install ffmpeg

# macOS
brew install ffmpeg

# Windows
# Download from https://ffmpeg.org/download.html

Test decord:

import decord
vr = decord.VideoReader('path/to/video.mp4')
print(f"Frames: {len(vr)}")

Transformers Model Download Issues

If model downloads are slow or fail:

Use Hugging Face mirror (China):

export HF_ENDPOINT=https://hf-mirror.com

Pre-download models:

from transformers import AutoModel
AutoModel.from_pretrained("BAAI/bge-large-en-v1.5")

Set cache directory:
```
export HF_HOME=/path/to/cache
```

Verify Installation

Run this comprehensive verification script:

import sys
import torch
import transformers
import torchvision
import decord
import cv2
import numpy as np
import pandas as pd

print("QualiVision Installation Check")
print("=" * 50)

# Python version
print(f"Python: {sys.version}")

# PyTorch
print(f"\nPyTorch: {torch.__version__}")
print(f"CUDA Available: {torch.cuda.is_available()}")
if torch.cuda.is_available():
    print(f"CUDA Version: {torch.version.cuda}")
    print(f"GPU: {torch.cuda.get_device_name(0)}")
    print(f"GPU Memory: {torch.cuda.get_device_properties(0).total_memory / 1e9:.2f} GB")

# Transformers
print(f"\nTransformers: {transformers.__version__}")

# Other key libraries
print(f"\nTorchvision: {torchvision.__version__}")
print(f"OpenCV: {cv2.__version__}")
print(f"NumPy: {np.__version__}")
print(f"Pandas: {pd.__version__}")
print(f"Decord: {decord.__version__}")

print("\n✓ All dependencies installed successfully!")

Save as check_install.py and run:

python check_install.py

Next Steps

Quick Start

Run your first evaluation with pre-trained models

Data Preparation

Learn how to structure your dataset

Model Configuration

Customize model settings for your use case

Training Guide

Fine-tune models on custom datasets

Getting Help: If you encounter issues not covered here, please:

Check the GitHub Issues
Review the example notebooks in notebooks/
See the memory optimization guide for common GPU issues

Get Started

Core Concepts

Guides

System Requirements

Python Version

GPU Support

Storage

Operating System

Installation Methods

Dependencies

GPU Setup

Troubleshooting

Verify Installation

Next Steps

Quick Start

Data Preparation

Model Configuration

Training Guide

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

​System Requirements

Python Version

GPU Support

Storage

Operating System

​Installation Methods

​Dependencies

​GPU Setup

​Troubleshooting

​Verify Installation

​Next Steps

Quick Start

Data Preparation

Model Configuration

Training Guide

Build docs developers (and LLMs) love

System Requirements

Installation Methods

Dependencies

GPU Setup

Troubleshooting

Verify Installation

Next Steps