← Back to all generators

resemble-ai/chatterbox-multilingual

Generate expressive, natural speech in 23 languages. Features instant voice cloning from short audio, emotion control, and seamless cross-language voice transfer.

Capabilities

Seed

Cost

Community model (estimated from hardware time)

Input Parameters

text required string

Text to synthesize into speech (maximum 300 characters)

cfg_weight number

CFG/Pace weight controlling generation guidance (0.2-1.0). Use 0.5 for balanced results, 0 for language transfer

Default: 0.5 min: 0.2, max: 1
exaggeration number

Controls speech expressiveness (0.25-2.0, neutral=0.5, extreme values may be unstable)

Default: 0.5 min: 0.25, max: 2
language string

Language for synthesis

Default: "en"
ar da de el en es fi fr he hi it ja ko ms nl no pl pt ru sv sw tr zh
reference_audio string

Reference audio file for voice cloning (optional). If not provided, uses default voice for the selected language.

seed integer

Random seed for reproducible results (0 for random generation)

Default: 0
temperature number

Controls randomness in generation (0.05-5.0, higher=more varied)

Default: 0.8 min: 0.05, max: 5
Version: 9cfba4c265e6 Updated: 2/26/2026 24.1K runs