Beyond inference and training, the Applio CLI includes a set of utility subcommands for inspecting models, merging model weights, downloading models from the internet, setting up the runtime environment, analyzing audio files, and launching TensorBoard. These commands are independent of the training pipeline and can be run at any time from the Applio root directory with the virtual environment activated.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/IAHispano/Applio/llms.txt
Use this file to discover all available pages before exploring further.
model_information
Themodel_information subcommand prints metadata stored inside a .pth model file — including the model architecture, the epoch at which it was saved, the sample rate it was trained at, and the embedder model used. This is useful for auditing unknown or shared models before using them for inference or blending.
Flags
Path to the
.pth model file to inspect.Example
model_blender
Themodel_blender subcommand fuses two RVC models by linearly interpolating their weights at a configurable ratio. The result is a new model file that combines vocal characteristics from both inputs. This technique — often called model merging or model interpolation — can produce hybrid voices that blend the timbre or style of two different source models.
Flags
Name for the output blended model. The resulting
.pth file will be saved to logs/<model_name>/.Path to the first source
.pth model file.Path to the second source
.pth model file.Blend ratio between the two models. Range:
0.0–1.0 in steps of 0.1. A value of 0.0 produces a copy of model 1; 1.0 produces a copy of model 2; 0.5 produces an equal blend. Intermediate values interpolate smoothly between the two.Example
Both models must have been trained with the same architecture (same sample rate and vocoder settings) for the blend to produce meaningful results. Blending incompatible architectures may fail or produce silent / degraded output.
download
Thedownload subcommand downloads a model archive from a direct URL or a Hugging Face model repository link, extracts the contents, and places the resulting files in the logs/ directory so they are immediately available for inference.
Flags
A direct download URL (e.g., a
.zip file) or a Hugging Face repository URL pointing to a model archive. The link must be publicly accessible.Example
logs/ ready for use with infer or tts.
prerequisites
Theprerequisites subcommand downloads and installs the files that Applio requires at runtime: official pretrained base model weights, supporting models (e.g., RMVPE), and platform-specific executables. Run this command once after a fresh installation, or again if any required files are missing.
Flags
Download the official Applio pretrained HiFi-GAN generator and discriminator weights used as a training starting point. Strongly recommended. Accepts
True or False.Download additional supporting models, including RMVPE and other pitch-extraction or embedding models. Accepts
True or False.Download required platform-specific executables (e.g., FFmpeg binaries used for audio processing). Accepts
True or False.Example
audio_analyzer
Theaudio_analyzer subcommand reads an audio file and prints a summary of its technical properties: sample rate, duration, bit depth, channel count, and RMS signal level. It also generates a waveform and spectrogram visualization saved as a PNG file at logs/audio_analysis.png.
Flags
Path to the audio file to analyze. Supports common audio formats (WAV, MP3, FLAC, etc.).
Outputs
The command prints the following information to stdout:- Sample rate (Hz)
- Duration (seconds)
- Bit depth
- Number of channels (mono / stereo)
- RMS level (dBFS)
logs/audio_analysis.png.
Example
tensorboard
Thetensorboard subcommand launches a TensorBoard server pointed at Applio’s logs/ directory. This lets you monitor training loss curves, inspect audio samples written during training, and compare multiple training runs in a browser-based UI.
Flags
This subcommand takes no arguments.Example
http://localhost:6006) that you can open in a browser. The server will remain running until you interrupt it with Ctrl+C.
TensorBoard data is only available while the
logs/ directory contains event files written by one or more training runs. Start a training job first, then open TensorBoard to view live or historical metrics.