OpenComic AI models are trained using traiNNer-redux, an open-source image restoration and super-resolution training framework. The repository includes ready-to-use traiNNer-redux configuration files underDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/ollm/opencomic-ai-training/llms.txt
Use this file to discover all available pages before exploring further.
options/train/ that match the architecture and data paths used for the official OpenComic AI model releases. You can use these configs directly or adapt them to your own dataset paths and hyperparameters.
Full documentation for traiNNer-redux configuration options, architecture references, and loss functions is available at trainner-redux.readthedocs.io.
Model Architectures
OpenComic AI ships three architecture tiers across all task families:| Architecture | Config folder | Description |
|---|---|---|
| ESRGAN | options/train/ESRGAN/ | Full-size ESRGAN generator. Highest quality; highest VRAM and inference time. |
| ESRGAN Lite | options/train/ESRGAN/ | Lighter ESRGAN variant (*-lite configs). Balanced quality and speed. |
| Compact | options/train/Compact/ | Smallest architecture (*-compact configs). Fastest inference; suitable for real-time or CPU use. |
network_g.type field and the pretrained model path differ between configs.
Pre-Training Model Chain
The OpenComic AI release models follow a deliberate pre-training chain that transfers general restoration knowledge before specializing for a harder task:- Artifact removal — trained from scratch (no pretrained base).
- Descreen — initialized from the
artifact-removalweights of the matching architecture tier, then fine-tuned on descreen data. Artifact removal features are a prerequisite because halftone patterns are a class of compression artifact. - Upscale 2x — initialized from
artifact-removalweights. Real-world scanned comics almost always carry compression and halftone artifacts before upscaling, so artifact-removal priors improve upscale quality on degraded inputs. - Upscale 3x / 4x — initialized from
upscale-2xweights of the matching tier, not fromartifact-removal.
path.pretrain_network_g field of each training config.
Training Workflow
Generate the paired dataset
Follow the steps in Generate Paired Training Datasets to produce a This writes image pairs to
clean/ and degraded/ folder. For example, to generate the upscale-2x dataset:datasets/opencomic-ai-upscale-2x/clean/ and datasets/opencomic-ai-upscale-2x/degraded/.Validate the dataset (recommended)
Before training, run
fix-images.mjs to remove any incomplete or dimension-mismatched pairs that could cause data loader errors. See Validate and Fix Paired Dataset Consistency for details.Point the training config to your dataset
Open the relevant config in Update
options/train/ESRGAN/. The upscale-2x config at options/train/ESRGAN/opencomic-ai-upscale-2x.yml contains the following dataset section:dataroot_gt and dataroot_lq to point at your generated dataset folders if they differ from the defaults. The lq_size of 64 corresponds to a 128×128 ground-truth crop at 2× scale (lq_size = gt_size / scale).Set the pretrained model path
For upscale-2x and descreen models, set For artifact-removal training from scratch, set
path.pretrain_network_g to the artifact-removal checkpoint of the matching architecture:pretrain_network_g to ~ (null).Configure network and scale
The top of each config sets the scale and generator architecture:For artifact-removal and descreen models,
scale: 1 and the clean and degraded images are the same spatial size.Training Config Reference (upscale-2x)
The following excerpt covers the key fields fromoptions/train/ESRGAN/opencomic-ai-upscale-2x.yml: