Non-geospatial datasets are pre-defined benchmark datasets without coordinate information. They are designed for standard computer vision tasks using PyTorch’s DataLoader.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/torchgeo/torchgeo/llms.txt
Use this file to discover all available pages before exploring further.
Base Classes
NonGeoDataset
Abstract base class for datasets lacking geospatial information.- Integer indexing (not spatial queries)
- Pre-defined image chips
- Compatible with PyTorch DataLoader
- Suitable for benchmarking and competitions
Retrieve sample by integer index:
dataset[0]Return number of samples in dataset
NonGeoClassificationDataset
Base class for classification datasets organized in folder structure (one folder per class).Root directory containing class subdirectories
Transform function applied to each sample
Function to load images from file paths
Function to filter valid image files
Common Parameters
Most non-geospatial datasets support these parameters:Root directory where dataset is stored or will be downloaded
Dataset split: typically ‘train’, ‘val’, or ‘test’
Transform function for data augmentation
If True, download the dataset if not found locally
If True, verify file integrity using MD5 checksums
Classification Datasets
Benchmark datasets for image classification tasks.EuroSAT
Sentinel-2 satellite imagery for land use classification.- 27,000 labeled images
- 13 spectral bands (Sentinel-2)
- 64x64 pixel images
- 10 classes: Annual Crop, Forest, Herbaceous Vegetation, Highway, Industrial, Pasture, Permanent Crop, Residential, River, Sea/Lake
EuroSAT: Full 13-band versionEuroSAT100: 100-sample subset for testingEuroSATSpatial: Spatial split variant
RESISC45
High-resolution remote sensing image scene classification.- 31,500 images
- RGB aerial imagery
- 256x256 pixels
- 45 scene classes (airplane, beach, bridge, forest, etc.)
UC Merced
Land use classification from high-resolution imagery.- 2,100 images
- RGB aerial imagery
- 256x256 pixels
- 21 classes (agricultural, airplane, baseball diamond, beach, etc.)
Additional Classification Datasets
Satellite Imagery:BigEarthNet: Large-scale Sentinel-2 benchmark (590,326 patches)BigEarthNetV2: Updated version with corrected labelsSo2Sat: Sentinel-1 & Sentinel-2 for local climate zonesPatternNet: 38-class remote sensing scene classificationMillionAID: Large-scale scene classification (1M images)
ADVANCE: Aerial scene understandingAID: Aerial Image Dataset (10,000 images, 30 classes)
CV4AKenyaCropType: Crop type classification (Kenya)SouthAfricaCropType: Crop type classification (South Africa)SSL4EOL: Self-supervised learning benchmarkTreeSatAI: Tree species classification
Segmentation Datasets
Benchmark datasets for semantic segmentation tasks.Inria Aerial Image Labeling
Building segmentation from aerial imagery.- RGB aerial imagery + binary building masks
- 5000x5000 pixel tiles
- 180 cities across the world
- Binary segmentation (building/background)
DeepGlobe Land Cover
Multi-class land cover segmentation.- RGB satellite imagery
- 2448x2448 pixels
- 7 classes: Urban, Agriculture, Rangeland, Forest, Water, Barren, Unknown
LoveDA
Urban/rural scene segmentation.- RGB imagery
- 1024x1024 pixels
- 7 classes: Background, Building, Road, Water, Barren, Forest, Agriculture
- Urban and rural scenes
Potsdam2D
High-resolution urban segmentation.- RGB + NIR imagery
- 6000x6000 pixel tiles
- 6 classes: Impervious surfaces, Building, Low vegetation, Tree, Car, Background
Vaihingen2D
High-resolution urban segmentation.- RGB + NIR + DSM
- Variable image sizes
- 6 classes: Same as Potsdam2D
Additional Segmentation Datasets
Urban/Building:Chesapeake: Land cover for Chesapeake Bay watershedChesapeakeCVPR: Competition variantLandCoverAI: Building/woodland/water/road segmentationSpaceNet: Building footprint extraction (multiple challenges)
PASTIS: Panoptic segmentation of satellite image time seriesAgriFieldNet: Agricultural field boundary delineationFieldsOfTheWorld: Global field boundary dataset
OSCD: Onera Satellite Change DetectionLEVIRCD: Building change detectionxBD/XView2: Building damage assessment
SEN12MS: Multi-modal (Sentinel-1/2, Landsat 8) segmentationDFC2022: Data Fusion Contest 2022EnviroAtlas: Multi-label land coverGID15: Large-scale land cover
Object Detection Datasets
Benchmark datasets for object detection tasks.VHR-10
Very high-resolution object detection.- 800 RGB images
- 10 classes: Airplane, Ship, Storage tank, Baseball diamond, Tennis court, Basketball court, Ground track field, Harbor, Bridge, Vehicle
- Bounding box annotations
DIOR
Object detection in optical remote sensing images.- 23,463 images
- RGB imagery
- 800x800 pixels
- 20 object categories
DOTA
Oriented object detection in aerial images.- Large-scale dataset
- Oriented bounding boxes
- 15-18 categories (depending on version)
- Very high resolution
Additional Detection Datasets
Object Detection:FAIR1M: Fine-grained object recognition (1M instances, 37 categories)COWC: Cars Overhead With ContextNASAMarineDebris: Marine debris detectionxView: 1M objects, 60 classes
RwandaFieldBoundary: Instance segmentation of crop fieldsIDTReeS: Individual tree crown detection
Multi-Task Datasets
Datasets supporting multiple task types.BigEarthNet
Large-scale multi-label classification.- 590,326 Sentinel-1 and Sentinel-2 patches
- Multi-label classification (19 classes)
- 120x120m patches
- Europe coverage
SEN12MS
Multi-modal semantic segmentation.- 180,662 triplets (Sentinel-1, Sentinel-2, Landsat 8)
- Global coverage
- Land cover classification
- Multi-modal learning
BioMassters
Biomass estimation from satellite imagery.- Sentinel-1 & Sentinel-2 time series
- Above-ground biomass regression
- Competition dataset
Temporal/Time Series Datasets
Datasets with temporal sequences.PASTIS
Panoptic segmentation of satellite time series.- Sentinel-2 time series
- 2,433 parcels
- 18 crop classes
- Temporal semantic segmentation
CropHarvest
Global crop type mapping.- Satellite time series
- Global coverage
- Binary crop/non-crop classification
SustainBench Crop Yield
Crop yield prediction.- Satellite imagery + weather data
- Yield regression task
- US coverage
Embedding Datasets
Pre-computed foundation model embeddings.Usage Examples
Basic Training Loop
With Transforms
Segmentation Example
See Also
- Geospatial Datasets - Datasets with coordinate information
- Transforms - Data augmentation techniques
- DataModules - Lightning DataModules for common datasets