Documentation Index
Fetch the complete documentation index at: https://mintlify.com/maxiricalde/ProfeLedesma/llms.txt
Use this file to discover all available pages before exploring further.
Overview
This workshop is built around a real-world solar irradiance dataset spanning four years (2020–2023) of 1-minute Global Horizontal Irradiance (GHI) measurements from station LQ, located in the La Quiaca area of northwestern Argentina at an altitude of 3,500 m. The four annual CSV files together contain approximately 1,764,898 raw 1-minute readings before any deduplication or resampling (2020: 526,994 rows; 2021: 503,874; 2022: 492,567; 2023 partial: 241,463). Alongside the measured time series, the dataset is paired with a modeled clear-sky reference generated by the ARGP clear-sky model. The modeled data covers a longer historical window, starting from 2009, and serves as a physically consistent benchmark for quality-control, gap-filling, and validation throughout the preprocessing pipeline.Measured Data — measured/GHI_LQyyyy.csv
The raw pyranometer files are stored as one CSV per year, named GHI_LQ2020.csv through GHI_LQ2023.csv. Each file contains the following columns:
| Column | Description |
|---|---|
AÑO | Calendar year of the measurement |
DIA JULIANO | Julian day of year (1–365/366) |
HORA | Minute-of-day counter (1-based) |
IRRADIANCIA (W/m2) | Measured GHI in watts per square metre |
Fecha | ISO-8601 timestamp (YYYY-MM-DD HH:MM:SS) |
GHI_LQ2020.csv:
Note: Negative values near zero are normal and represent instrument noise during the night-time baseline.The two columns actually used when loading the data are column index 4 (
Fecha) and column index 3 (IRRADIANCIA (W/m2)).
Modeled Data — modelated/LQ.csv
The ARGP clear-sky model output is stored in modelated/LQ.csv and contains two columns:
| Column | Description |
|---|---|
date | ISO-8601 timestamp |
GHI | Modeled GHI in watts per square metre |
modelated/LQ.csv:
Atmospheric Reanalysis Reference — modelated/LQ_regs_60.csv
A complementary file modelated/LQ_regs_60.csv provides hourly atmospheric reanalysis variables that are used as regression features in later modeling stages:
| Column | Description |
|---|---|
datetime | ISO-8601 timestamp (hourly) |
tm | Near-surface temperature (normalised) |
uw | Eastward (zonal) wind component |
dw | Northward (meridional) wind component |
tcwv | Total column water vapour (kg m⁻²) |
d2m | 2-metre dewpoint temperature (K) |
modelated/LQ_regs_60.csv:
Dataset at a Glance
| Property | Value |
|---|---|
| Station | LQ (La Quiaca area, Argentina) |
| Altitude | 3,500 m a.s.l. |
| Measurement period | 2020 – 2023 |
| Temporal resolution | 1 minute |
| Total raw rows (all years) | ~1,764,898 |
| Total years | 4 |
| Modeled reference start | 2009 |
| Sensor type | Pyranometer (real field measurements) |
Continue Exploring
Loading Data
Step-by-step guide to reading, renaming, parsing and reindexing the multi-year CSV files with pandas.
Measurement Stations
Reference for all 16 GHI stations in the South American network, including geographic coordinates.