← Back to all generators

lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Capabilities

No capability data available

Cost

Community model (estimated from hardware time)

Input Parameters

speaker required string

Original speaker audio (wav, mp3, m4a, ogg, or flv)

cleanup_voice boolean

Whether to apply denoising to the speaker audio (microphone recordings)

Default: false
language string

Output language for the synthesised speech

Default: "en"
en es fr de it pt pl tr ru nl cs ar zh hu ko hi
text string

Text to synthesize

Default: "Hi there, I'm your new voice clone. Try your best to upload quality audio"
Version: 684bc3855b37 Updated: 2/26/2026 5.0M runs