← Back to all generators
lucataco/xtts-v2
Official
View on Replicate →
Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning
Capabilities
No capability data available
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
speaker * | string (uri) | Original speaker audio (wav, mp3, m4a, ogg, or flv) | — | — |
cleanup_voice | boolean | Whether to apply denoising to the speaker audio (microphone recordings) | false | — |
language | string | Output language for the synthesised speech | "en" | en es fr de it pt pl tr ru nl cs ar zh hu ko hi |
text | string | Text to synthesize | "Hi there, I'm your new voice clone. Try your best to upload quality audio" | — |
speaker required string Original speaker audio (wav, mp3, m4a, ogg, or flv)
cleanup_voice boolean Whether to apply denoising to the speaker audio (microphone recordings)
Default:
false language string Output language for the synthesised speech
Default:
"en" en es fr de it pt pl tr ru nl cs ar zh hu ko hi
text string Text to synthesize
Default:
"Hi there, I'm your new voice clone. Try your best to upload quality audio" Version:
684bc3855b37 Updated: 2/26/2026 5.0M runs
cinemasetfree.com