The SQLMorph paper evaluates three state-of-the-art Text-to-SQL systems — CHESS, DIN-SQL, and MAC-SQL — on the BIRD dev set using both Join Query Expansion (JQE) and the relaxed evaluation metrics framework. This page shows you how to reproduce every experiment. System predictions are already included in the repository underDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/dais-polymtl/sqlmorph/llms.txt
Use this file to discover all available pages before exploring further.
data/experiments/<system>/, so you do not need to re-run inference; the experiment scripts read those outputs directly.
Systems evaluated
| System | Output data location |
|---|---|
| CHESS | data/experiments/CHESS/ |
| DIN-SQL | data/experiments/DIN-SQL/ |
| MAC-SQL | data/experiments/MAC-SQL/ |
JQE experiments
Experiments 1 and 2 — Connectivity and cyclicity
These experiments measure the structural properties of the expansion set versus the original BIRD dev queries. The full set of expansion graphs is stored in:experiments/augmentation/:
| Output file | Contents |
|---|---|
augmented_join_details.csv | Average degree and cycle presence for each augmented query |
original_join_details.csv | Average degree and cycle presence for each original BIRD dev query |
Experiment 3 — Systems performance on unique expansions
CHESS, DIN-SQL, and MAC-SQL were evaluated on 58 unique expansion queries derived from the BIRD dev set. To compute each system’s scores on the original queries, the unique expansions, and the Delta EX metric (EX_expanded − EX_original):data/experiments/augmentation/:
| Output file | Contents |
|---|---|
<system>_aug_mode_results.csv | Per-query results on the unique expansions |
<system>_dev_mode_results.csv | Per-query results on the original dev queries |
<system>_delta_ex_results.csv | Delta EX value for each unique expansion query |
Experiment 3 — Systems performance on sampled expansions
The same three systems were also evaluated on 408 queries sampled from the full expansion set. To generate the results filedata/experiments/join_sampling_results.csv:
Both Experiment 3 variants read system prediction files from
data/experiments/<system>/. Ensure the data directory was downloaded correctly before running these scripts.