Documentation Index
Fetch the complete documentation index at: https://mintlify.com/atomind-ai/mlip-arena/llms.txt
Use this file to discover all available pages before exploring further.
Physical motivation
Combustion is a prototypical example of reactive chemistry: bonds break and form during the simulation, and the final state of the system is chemically different from the initial state. Most MLIPs are trained on static DFT calculations and do not explicitly model bond breaking and forming — they are expected to learn these processes implicitly from their training data. This benchmark tests whether an MLIP can:- Drive a high-temperature chemical reaction to completion
- Predict the correct products (water molecules from hydrogen combustion)
- Reproduce the experimental reaction enthalpy
- Maintain simulation stability throughout the highly exothermic reaction event
Reaction and system
The benchmark simulates hydrogen combustion:H256O128) — equivalent to 64 formula units of the stoichiometric 2H₂ + O₂ mixture. The initial configuration is provided in benchmarks/combustion/H256O128.extxyz.
The Prefect flow is importable from:
benchmarks/combustion/run.ipynb for a complete walkthrough.
Temperature protocol
The simulation follows a three-stage temperature ramp:- Ramp up from 300 K to 3000 K over the first third of the simulation
- Hold at 3000 K for the middle third (the combustion event occurs here)
- Ramp down from 3000 K back to 300 K over the final third
Metrics
| Metric | Description |
|---|---|
| Reaction yield (%) | Fraction of H₂O molecules formed relative to the stoichiometric maximum (128 molecules) |
| Reaction enthalpy ΔH (kcal/mol) | Energy change per water molecule, compared to experimental reference of −68.3 kcal/mol |
| Number of products vs. timestep | Time series of water molecule count — reveals reaction kinetics and completeness |
| Temperature vs. timestep | Actual temperature trajectory vs. the target ramp |
| Energy drift ΔE (kcal/mol) | Total energy change per water molecule over the simulation |
| Center of mass drift (Å) | Displacement of the system’s center of mass — a stability diagnostic |
| Steps per second | MD throughput on a single A100 GPU |
The experimental reference enthalpy (−68.3 kcal/mol) comes from the CRC Handbook of Chemistry and Physics (Lide, 2004). The conversion factor used is 23.0609 eV to kcal/mol.
Experimental reference
Two external references are used for comparison:- Flame temperature window (512,345 – 666,667 timesteps): from Hasche et al. (2023), Fuel, 352, 128964 — experimental assessment of hydrogen combustion flame temperatures.
- Reaction enthalpy: −68.3 kcal/mol from the CRC Handbook of Chemistry and Physics, Vol. 85 (Lide, 2004).
Model support
The following models support this benchmark (gpu-tasks: combustion in the model registry):
| Model | Family | Training data |
|---|---|---|
| MACE-MP(M) | mace-mp | MPTrj |
| MACE-MPA | mace-mp | MPTrj, Alexandria |
| CHGNet | chgnet | MPTrj |
| M3GNet | matgl | MPF |
| MatterSim | mattersim | MPTrj, Alexandria |
| ORBv2 | orb | MPTrj, Alexandria |
| ORB | orb | MPTrj, Alexandria |
| SevenNet | sevennet | MPTrj |
| EquiformerV2(OC20) | equiformer | OC20 |
| eSCN(OC20) | escn | OC20 |
How to run
benchmarks/combustion/run.ipynb.
Results for each model are saved as JSON files in benchmarks/combustion/<family>/<model>_H256O128.json, containing per-timestep arrays of:
timestepnproducts(number of water molecules)temperaturesenergiescom_drifts(center of mass drift in x, y, z)yield(final fractional yield)steps_per_second