Generate Publication-Ready ECG Classification Figures

The ssrl_ecg.visualization module provides a small set of functions that produce consistent, publication-quality matplotlib figures for all key SSRL-ECG experiments. Every function calls set_publication_style() to enforce 300 DPI resolution, uniform font sizes, and tight bounding boxes — so figures are ready to drop into a paper or report without further editing.

Setting the Publication Style

Before generating any figure, call set_publication_style() once. It configures global rcParams for the current Python session, affecting every subsequent plt call.

from ssrl_ecg.visualization import set_publication_style

set_publication_style()

What it configures

figure.dpi / savefig.dpi

int = 300

Both screen display and saved files are rendered at 300 DPI, meeting the minimum requirement for most journal submissions.

figure.figsize

tuple = (8, 6)

Default figure size in inches. Individual plot functions may override this for specific layouts (e.g. the robustness bar chart uses (9, 6)).

font.family / font.size

str / int

Sans-serif font family at 11 pt base size, with axes labels at 12 pt and titles at 13 pt for clear readability at journal column widths.

axes.grid / grid.alpha

bool / float = True / 0.3

Subtle background grid (30% opacity) aids quantitative reading without cluttering the figure.

savefig.bbox

str = 'tight'

Saves figures with a tight bounding box to avoid whitespace around the axes, which is the expected format for most submission systems.

`plot_roc_curve()`

Plots a ROC curve for a binary or multi-label classifier, annotates the legend with the AUC value, and draws the random-classifier diagonal.

from ssrl_ecg.visualization import plot_roc_curve, set_publication_style
import matplotlib.pyplot as plt

set_publication_style()
fig, ax = plt.subplots()
plot_roc_curve(y_true, y_prob, label='SimCLR', ax=ax, color='blue')
plt.savefig('figures/roc_curve.png')

Parameters

y_true

np.ndarray

required

Ground-truth labels. Accepts 1-D binary arrays or 2-D multi-label matrices of shape (N, C). For multi-label input, the mean across classes is used as the aggregate score.

y_prob

np.ndarray

required

Predicted probabilities matching the shape of y_true. For multi-label input, the per-sample mean probability is used.

label

str

default:"\"\""

Legend label for this ROC curve. The computed AUC is appended automatically, e.g. "SimCLR (AUC = 0.872)".

matplotlib.axes.Axes

default:"None"

Axes object to draw on. If None, a new figure and axes are created automatically.

color

str

default:"\"b\""

Line colour for the ROC curve. Accepts any matplotlib colour string or hex code.

Return values

fig

matplotlib.figure.Figure | None

The figure object. Returns None if an existing ax was passed in (figure ownership remains with the caller).

matplotlib.axes.Axes

The axes object with the ROC curve drawn on it. Useful for overlaying multiple curves.

roc_auc

float

The computed AUC value for this curve, calculated with sklearn.metrics.auc.

Overlaying multiple models

To compare several checkpoints on the same axes, pass the same ax object to each call:

from ssrl_ecg.visualization import plot_roc_curve, set_publication_style
import matplotlib.pyplot as plt

set_publication_style()
fig, ax = plt.subplots()

_, ax, _ = plot_roc_curve(y_true, y_prob_simclr, label='SimCLR', ax=ax, color='steelblue')
_, ax, _ = plot_roc_curve(y_true, y_prob_byol,   label='BYOL',   ax=ax, color='darkorange')
_, ax, _ = plot_roc_curve(y_true, y_prob_sup,    label='Supervised', ax=ax, color='seagreen')

plt.savefig('figures/roc_curve.png')

Generating All Figures at Once

The generate_visualizations.py script at the project root loads a fine-tuned checkpoint, runs inference on the PTB-XL test set, and writes all three publication figures to the figures/ directory.

python generate_visualizations.py

Ensure prerequisites

The script requires a fine-tuned checkpoint at checkpoints/ssl_finetuned_10pct.pt and the PTB-XL dataset at data/PTB-XL. Make sure the figures/ directory exists (it is created automatically).

Run the script

python generate_visualizations.py

The script prints progress for each figure as it is saved:

Generating publication-ready figures...

[1] Generating ROC curves...
  Saved: figures/roc_curve.png

[2] Generating label efficiency curve...
  Saved: figures/label_efficiency.png

[3] Generating robustness comparison...
  Saved: figures/robustness_comparison.png

[OK] All figures generated successfully!

Find the output files

All figures are saved to the figures/ directory at 300 DPI with tight bounding boxes.

Output Figures

figures/roc_curve.png

ROC curves for each of the five cardiovascular classes plus the macro average. Shows per-class AUC and the random-classifier baseline diagonal.

figures/label_efficiency.png

AUROC vs label fraction (log scale x-axis) for SSL fine-tuned and supervised baseline, with shaded ±1 std bands. Illustrates the SSL advantage at low label fractions.

figures/robustness_comparison.png

Grouped bar chart comparing AUROC under clean, noisy (noise_std=0.1), and masked (mask_ratio=0.2) conditions for each model variant.

Evaluation Scripts in `evaluation/`

The evaluation/ directory contains four standalone scripts for deeper analysis. Each script is self-contained and can be run independently after checkpoints are available.

compare_baselines.py

Evaluates SimCLR, BYOL, ResNet-1D, and supervised checkpoints side-by-side on the PTB-XL test set. Prints a ranked summary table sorted by AUROC and identifies the best-performing method.

per_class_analysis.py

Computes per-class AUROC for BCE baseline vs focal-loss + oversampling. Highlights improvements for underrepresented classes such as HYP and CD, which were the original focus of the class-balancing work.

compare_losses.py

Runs a comprehensive comparison of BCE vs focal-loss training strategies on the test set, printing absolute and percentage AUROC/F1 differences and flagging whether improvements are positive on both metrics.

generate_publication_report.py

Combines multi-seed phase-2 results (from results/phase2_multiseed_results.json) with phase-3 baseline comparisons to produce a formatted markdown table and statistical significance report (t-test, Cohen’s d).

# Run any evaluation script directly
python evaluation/compare_baselines.py
python evaluation/per_class_analysis.py
python evaluation/compare_losses.py
python evaluation/generate_publication_report.py

Run generate_publication_report.py last — it aggregates outputs from the other scripts and the multi-seed training runs into a single printable summary suitable for inclusion in a paper appendix.

Get Started

Concepts

Training

Evaluation & Analysis

Guides

Generate Publication-Ready ECG Classification Figures

Setting the Publication Style

What it configures

`plot_roc_curve()`

Parameters

Return values

Overlaying multiple models

Generating All Figures at Once

Output Figures

figures/roc_curve.png

figures/label_efficiency.png

figures/robustness_comparison.png

Evaluation Scripts in `evaluation/`

compare_baselines.py

per_class_analysis.py

compare_losses.py

generate_publication_report.py

Build docs developers (and LLMs) love

Get Started

Concepts

Training

Evaluation & Analysis

Guides

Documentation Index

​Setting the Publication Style

​What it configures

​plot_roc_curve()

​Parameters

​Return values

​Overlaying multiple models

​Generating All Figures at Once

​Output Figures

figures/roc_curve.png

figures/label_efficiency.png

figures/robustness_comparison.png

​Evaluation Scripts in evaluation/

compare_baselines.py

per_class_analysis.py

compare_losses.py

generate_publication_report.py

Build docs developers (and LLMs) love

Setting the Publication Style

What it configures

`plot_roc_curve()`

Parameters

Return values

Overlaying multiple models

Generating All Figures at Once

Output Figures

Evaluation Scripts in `evaluation/`