← Back to all generators

resemble-ai/chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

Capabilities

Seed Top-P

Cost

Community model (estimated from hardware time)

Input Parameters

text required string

Text to synthesize into speech (maximum 500 characters). Supported paralinguistic tags you can include in your text: [clear throat], [sigh], [sush], [cough], [groan], [sniff], [gasp], [chuckle], [laugh] Example: "Oh, that's hilarious! [chuckle] Let me tell you more."

reference_audio string

Reference audio file for voice cloning (optional). Must be longer than 5 seconds. If provided, overrides the voice selection.

repetition_penalty number

Penalizes token repetition. Higher values reduce repetition.

Default: 1.2 min: 1, max: 2
seed integer

Random seed for reproducible results. Leave blank for random generation.

temperature number

Controls randomness in generation. Higher values produce more varied speech.

Default: 0.8 min: 0.05, max: 2
top_k integer

Top-k sampling. Limits vocabulary to top k tokens at each step.

Default: 1000 min: 1, max: 2000
top_p number

Nucleus sampling threshold. Lower values make output more focused.

Default: 0.95 min: 0.5, max: 1
voice string

Pre-made voice to use for synthesis. Ignored if reference_audio is provided.

Default: "Andy"
Aaron Abigail Anaya Andy Archer Brian Chloe Dylan Emmanuel Ethan Evelyn Gavin Gordon Ivan Laura Lucy Madison Marisol Meera Walter
Version: 95c87b883ff3 Updated: 2/26/2026 138.4K runs