← Back to all generators
minimax/voice-cloning
Official
View on Replicate →
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo
Capabilities
No capability data available
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
voice_file * | string (uri) | Voice file to clone. Must be MP3, M4A, or WAV format, 10s to 5min duration, and less than 20MB. | — | — |
accuracy | number | Text validation accuracy threshold (0-1) | 0.7 | min: 0, max: 1 |
model | string | The text-to-speech model to train | "speech-02-turbo" | speech-2.6-turbo speech-2.6-hd speech-02-turbo speech-02-hd |
need_noise_reduction | boolean | Enable noise reduction. Use this if the voice file has background noise. | false | — |
need_volume_normalization | boolean | Enable volume normalization | false | — |
voice_file required string Voice file to clone. Must be MP3, M4A, or WAV format, 10s to 5min duration, and less than 20MB.
accuracy number Text validation accuracy threshold (0-1)
Default:
0.7 min: 0, max: 1 model string The text-to-speech model to train
Default:
"speech-02-turbo" speech-2.6-turbo speech-2.6-hd speech-02-turbo speech-02-hd
need_noise_reduction boolean Enable noise reduction. Use this if the voice file has background noise.
Default:
false need_volume_normalization boolean Enable volume normalization
Default:
false Version:
fff8a670880f Updated: 2/26/2026 44.7K runs
cinemasetfree.com