Applio includes a built-in audio effects chain that runs immediately after voice conversion, before the output file is written to disk. These effects are powered by Spotify’sDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/IAHispano/Applio/llms.txt
Use this file to discover all available pages before exploring further.
pedalboard library, a high-quality Python audio effects processor. Every effect is optional and independently toggled — you can chain as many or as few as you need. Effects are applied in the order they appear in the pipeline: reverb → pitch shift → limiter → gain → distortion → chorus → bitcrush → clipping → compressor → delay.
Enabling Post-Processing
Post-processing is disabled by default. To activate it, setpost_process=True in either the Python API or the CLI. Each individual effect also requires its own flag (e.g. reverb=True). Setting post_process=True without enabling any individual effect flag is a no-op.
Post-processing is applied to the full converted audio signal. If you use
split_audio=True, the effects are applied after the segments are merged back together.Available Effects
Reverb
Reverb
Simulates the acoustic reflections of a physical space. Higher room sizes produce longer, more diffuse tails; damping controls how quickly high frequencies decay.Enable with
reverb=True.Size of the simulated room. Range:
0.0 (dry, tight) to 1.0 (large hall). Higher values produce longer reverb tails.High-frequency damping applied to the reverb tail. Range:
0.0 to 1.0. Higher values make the tail sound darker and decay faster.Level of the wet (reverberated) signal in the mix. Range:
0.0 to 1.0.Level of the dry (original) signal in the mix. Range:
0.0 to 1.0. Set to 0.0 for a fully wet signal.Stereo width of the reverb effect. Range:
0.0 (mono) to 1.0 (full stereo spread).Controls infinite reverb sustain. Range:
0.0 (normal decay) to 1.0 (reverb sustains indefinitely without decaying).Pitch Shift
Pitch Shift
Transposes the pitch of the output audio by a fixed number of semitones, independent of the model’s pitch parameter. Useful for fine-tuning the final pitch of the converted audio without re-running inference.Enable with
pitch_shift=True.Number of semitones to shift the pitch. Positive values raise the pitch; negative values lower it. For example,
12.0 shifts up one octave.Limiter
Limiter
A brickwall limiter that prevents the output from exceeding a set threshold in decibels. Essential for preventing clipping when using effects that increase gain (e.g. reverb, distortion, gain).Enable with
limiter=True.Maximum output level in dBFS. Anything above this value is transparently attenuated. A value of
-6 gives 6 dB of headroom below 0 dBFS.Release time in seconds. Controls how quickly the limiter stops attenuating after the signal falls below the threshold. Shorter values sound more responsive; longer values sound smoother.
Gain
Gain
Applies a fixed gain (amplification or attenuation) in decibels to the output signal. Use in combination with the limiter to boost quiet conversions safely.Enable with
gain=True.Gain to apply in decibels. Positive values amplify; negative values attenuate. For example,
6.0 doubles the amplitude, -6.0 halves it.Distortion
Distortion
Applies soft-clipping harmonic distortion to the signal. Adds grit and harmonic overtones. At high drive values the output will be heavily saturated.Enable with
distortion=True.Drive amount in dB. Higher values produce heavier distortion. A value around
10–20 gives mild saturation; values above 30 produce heavy clipping.Chorus
Chorus
Creates a thick, doubling effect by layering slightly pitch-modulated and time-delayed copies of the signal. Useful for thickening vocal conversions.Enable with
chorus=True.Modulation rate in Hz. Controls how fast the pitch oscillates. Lower values create a slow, wide sweep; higher values create a rapid vibrato-like effect.
Modulation depth as a fraction (
0.0–1.0). Higher values create a more pronounced pitch variation between the original and chorus voices.Centre delay time in milliseconds. The average time offset of the chorus copies from the dry signal.
Amount of the chorus output fed back into the input (
0.0–1.0). Higher values create a more resonant, metallic effect.Wet/dry mix ratio.
0.0 is fully dry; 1.0 is fully wet (chorus only).Bitcrush
Bitcrush
Reduces the bit depth of the audio to simulate the lo-fi sound of vintage digital samplers or game audio hardware.Enable with
bitcrush=True.Target bit depth. Lower values produce more quantization noise and a grittier sound.
8 gives a classic 8-bit character; 4 is extremely degraded.Clipping
Clipping
Hard-clips the audio at a threshold, producing harsh distortion. Unlike the limiter (which attenuates transparently), clipping introduces audible harmonic distortion at the threshold boundary.Enable with
clipping=True.Threshold in dBFS at which hard clipping occurs. Any sample exceeding this level is clipped flat. Values closer to
0 allow more signal through before clipping.Compressor
Compressor
Reduces the dynamic range of the signal by attenuating loud passages. Useful for evening out volume inconsistencies in the converted voice.Enable with
compressor=True.Level in dBFS above which compression is applied. Signals below this level pass through unaffected.
Compression ratio (e.g.
4 means 4:1 — for every 4 dB above the threshold, only 1 dB comes through). A ratio of 1 means no compression.Attack time in milliseconds. Controls how quickly the compressor reacts when the signal exceeds the threshold. Shorter values clamp transients harder.
Release time in milliseconds. Controls how quickly the compressor stops compressing after the signal falls below the threshold.
Delay
Delay
Adds an echo effect by mixing a time-delayed copy of the signal back into the output. The feedback parameter controls how many echo repeats you hear.Enable with
delay=True.Delay time in seconds. For example,
0.5 produces an echo half a second after the original signal.Amount of the delayed signal fed back into the delay line (
0.0–1.0). Higher values create multiple repeating echoes. Values approaching 1.0 may cause runaway feedback — use with caution.Wet/dry mix ratio.
0.0 is the dry signal only; 1.0 is the delayed signal only.CLI Example
The following example applies reverb and a compressor to the output. All effect flags must be explicitly set toTrue: