← Back to all generators

minimax/voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Capabilities

No capability data available

Cost

Community model (estimated from hardware time)

Input Parameters

voice_file required string

Voice file to clone. Must be MP3, M4A, or WAV format, 10s to 5min duration, and less than 20MB.

accuracy number

Text validation accuracy threshold (0-1)

Default: 0.7 min: 0, max: 1
model string

The text-to-speech model to train

Default: "speech-02-turbo"
speech-2.6-turbo speech-2.6-hd speech-02-turbo speech-02-hd
need_noise_reduction boolean

Enable noise reduction. Use this if the voice file has background noise.

Default: false
need_volume_normalization boolean

Enable volume normalization

Default: false
Version: fff8a670880f Updated: 2/26/2026 44.7K runs