zsxkib/realistic-voice-cloning
Create song covers with any RVC v2 trained AI voice from audio files.
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
backup_vocals_volume_change | number | Control volume of backup AI vocals. | 0 | — |
crepe_hop_length | integer | When `pitch_detection_algo` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds. Lower values lead to longer conversions and higher risk of voice cracks, but better pitch accuracy. | 128 | — |
custom_rvc_model_download_url | string | URL to download a custom RVC model. If provided, the model will be downloaded (if it doesn't already exist) and used for prediction, regardless of the 'rvc_model' value. | — | — |
filter_radius | integer | If >=3: apply median filtering median filtering to the harvested pitch results. | 3 | min: 0, max: 7 |
index_rate | number | Control how much of the AI's accent to leave in the vocals. | 0.5 | min: 0, max: 1 |
instrumental_volume_change | number | Control volume of the background music/instrumentals. | 0 | — |
main_vocals_volume_change | number | Control volume of main AI vocals. Use -3 to decrease the volume by 3 decibels, or 3 to increase the volume by 3 decibels. | 0 | — |
output_format | string | wav for best quality and large file size, mp3 for decent quality and small file size. | "mp3" | mp3 wav |
pitch_change | string | Adjust pitch of AI vocals. Options: `no-change`, `male-to-female`, `female-to-male`. | "no-change" | no-change male-to-female female-to-male |
pitch_change_all | number | Change pitch/key of background music, backup vocals and AI vocals in semitones. Reduces sound quality slightly. | 0 | — |
pitch_detection_algorithm | string | Best option is rmvpe (clarity in vocals), then mangio-crepe (smoother vocals). | "rmvpe" | rmvpe mangio-crepe |
protect | number | Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable. | 0.33 | min: 0, max: 0.5 |
reverb_damping | number | Absorption of high frequencies in the reverb. | 0.7 | min: 0, max: 1 |
reverb_dryness | number | Level of AI vocals without reverb. | 0.8 | min: 0, max: 1 |
reverb_size | number | The larger the room, the longer the reverb time. | 0.15 | min: 0, max: 1 |
reverb_wetness | number | Level of AI vocals with reverb. | 0.2 | min: 0, max: 1 |
rms_mix_rate | number | Control how much to use the original vocal's loudness (0) or a fixed loudness (1). | 0.25 | min: 0, max: 1 |
rvc_model | string | RVC model for a specific voice. If using a custom model, this should match the name of the downloaded model. If a 'custom_rvc_model_download_url' is provided, this will be automatically set to the name of the downloaded model. | "Squidward" | Squidward MrKrabs Plankton Drake Vader Trump Biden Obama Guitar Voilin CUSTOM |
song_input | string (uri) | Upload your audio file here. | — | — |
backup_vocals_volume_change number Control volume of backup AI vocals.
0 crepe_hop_length integer When `pitch_detection_algo` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds. Lower values lead to longer conversions and higher risk of voice cracks, but better pitch accuracy.
128 custom_rvc_model_download_url string URL to download a custom RVC model. If provided, the model will be downloaded (if it doesn't already exist) and used for prediction, regardless of the 'rvc_model' value.
filter_radius integer If >=3: apply median filtering median filtering to the harvested pitch results.
3 min: 0, max: 7 index_rate number Control how much of the AI's accent to leave in the vocals.
0.5 min: 0, max: 1 instrumental_volume_change number Control volume of the background music/instrumentals.
0 main_vocals_volume_change number Control volume of main AI vocals. Use -3 to decrease the volume by 3 decibels, or 3 to increase the volume by 3 decibels.
0 output_format string wav for best quality and large file size, mp3 for decent quality and small file size.
"mp3" pitch_change string Adjust pitch of AI vocals. Options: `no-change`, `male-to-female`, `female-to-male`.
"no-change" pitch_change_all number Change pitch/key of background music, backup vocals and AI vocals in semitones. Reduces sound quality slightly.
0 pitch_detection_algorithm string Best option is rmvpe (clarity in vocals), then mangio-crepe (smoother vocals).
"rmvpe" protect number Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable.
0.33 min: 0, max: 0.5 reverb_damping number Absorption of high frequencies in the reverb.
0.7 min: 0, max: 1 reverb_dryness number Level of AI vocals without reverb.
0.8 min: 0, max: 1 reverb_size number The larger the room, the longer the reverb time.
0.15 min: 0, max: 1 reverb_wetness number Level of AI vocals with reverb.
0.2 min: 0, max: 1 rms_mix_rate number Control how much to use the original vocal's loudness (0) or a fixed loudness (1).
0.25 min: 0, max: 1 rvc_model string RVC model for a specific voice. If using a custom model, this should match the name of the downloaded model. If a 'custom_rvc_model_download_url' is provided, this will be automatically set to the name of the downloaded model.
"Squidward" song_input string Upload your audio file here.
0a9c7c558af4 Updated: 2/26/2026 1.5M runs
cinemasetfree.com