← Back to all generators

zsxkib/realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

Capabilities

No capability data available

Cost

Community model (estimated from hardware time)

Input Parameters

backup_vocals_volume_change number

Control volume of backup AI vocals.

Default: 0
crepe_hop_length integer

When `pitch_detection_algo` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds. Lower values lead to longer conversions and higher risk of voice cracks, but better pitch accuracy.

Default: 128
custom_rvc_model_download_url string

URL to download a custom RVC model. If provided, the model will be downloaded (if it doesn't already exist) and used for prediction, regardless of the 'rvc_model' value.

filter_radius integer

If >=3: apply median filtering median filtering to the harvested pitch results.

Default: 3 min: 0, max: 7
index_rate number

Control how much of the AI's accent to leave in the vocals.

Default: 0.5 min: 0, max: 1
instrumental_volume_change number

Control volume of the background music/instrumentals.

Default: 0
main_vocals_volume_change number

Control volume of main AI vocals. Use -3 to decrease the volume by 3 decibels, or 3 to increase the volume by 3 decibels.

Default: 0
output_format string

wav for best quality and large file size, mp3 for decent quality and small file size.

Default: "mp3"
mp3 wav
pitch_change string

Adjust pitch of AI vocals. Options: `no-change`, `male-to-female`, `female-to-male`.

Default: "no-change"
no-change male-to-female female-to-male
pitch_change_all number

Change pitch/key of background music, backup vocals and AI vocals in semitones. Reduces sound quality slightly.

Default: 0
pitch_detection_algorithm string

Best option is rmvpe (clarity in vocals), then mangio-crepe (smoother vocals).

Default: "rmvpe"
rmvpe mangio-crepe
protect number

Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable.

Default: 0.33 min: 0, max: 0.5
reverb_damping number

Absorption of high frequencies in the reverb.

Default: 0.7 min: 0, max: 1
reverb_dryness number

Level of AI vocals without reverb.

Default: 0.8 min: 0, max: 1
reverb_size number

The larger the room, the longer the reverb time.

Default: 0.15 min: 0, max: 1
reverb_wetness number

Level of AI vocals with reverb.

Default: 0.2 min: 0, max: 1
rms_mix_rate number

Control how much to use the original vocal's loudness (0) or a fixed loudness (1).

Default: 0.25 min: 0, max: 1
rvc_model string

RVC model for a specific voice. If using a custom model, this should match the name of the downloaded model. If a 'custom_rvc_model_download_url' is provided, this will be automatically set to the name of the downloaded model.

Default: "Squidward"
Squidward MrKrabs Plankton Drake Vader Trump Biden Obama Guitar Voilin CUSTOM
song_input string

Upload your audio file here.

Version: 0a9c7c558af4 Updated: 2/26/2026 1.5M runs