Class Signature
Parameters
Dataset name (e.g., “ett”, “etth1”). Used to automatically load datasets from predefined sources.
Name of the datetime column in the CSV file.
Path to the CSV file containing time-series data.
Train/val/test split boundaries. If [0,0,0], defaults are computed automatically.
Batch size for dataloaders.
Dataset mode: “train” or “test”.
Stride for windowing when creating sequences.
Length of historical context to use for forecasting.
Forecast horizon length (number of future steps to predict).
Attributes
Maximum number of columns (channels) supported.
Number of samples per chunk, computed as
(length_timeseries - context_len - horizon_len) // stride + 1.Methods
get_data_loader()
Returns a PyTorch DataLoader for the dataset.PyTorch DataLoader configured with the specified batch size.
len()
Returns the number of samples in the dataset.Total number of samples available.
Usage Example
Notes
- The dataset automatically handles train/test splitting based on boundaries
- Windowing is applied with the specified stride to create overlapping sequences
- Maximum of 64 channels supported by default
- Data is read from CSV and preprocessed automatically