← Back to all generators
google/gemini-3.1-flash-tts
Official
View on Replicate →
Google's fast, expressive text-to-speech model with 30 voices and 70+ language support
Capabilities
No capability data available
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
text * | string | The text to convert to speech. Supports markup tags like " "[sigh], [laughing], [whispering], [shouting], [extremely fast] for " "expressive delivery. Maximum 4,000 bytes. | — | — |
language_code | string | Language for the speech output | "en-US" | af-ZA am-ET ar-001 ar-EG az-AZ be-BY bg-BG bn-BD ca-ES ceb-PH cmn-CN cmn-tw cs-CZ da-DK de-DE el-GR en-AU en-GB en-IN en-US es-419 es-ES es-MX et-EE eu-ES fa-IR fi-FI fil-PH fr-CA fr-FR gl-ES gu-IN he-IL hi-IN hr-HR ht-HT hu-HU hy-AM id-ID is-IS it-IT ja-JP jv-JV ka-GE kn-IN ko-KR kok-IN la-VA lb-LU lo-LA lt-LT lv-LV mai-IN mg-MG mk-MK ml-IN mn-MN mr-IN ms-MY my-MM nb-NO ne-NP nl-NL nn-NO or-IN pa-IN pl-PL ps-AF pt-BR pt-PT ro-RO ru-RU sd-IN si-LK sk-SK sl-SI sq-AL sr-RS sv-SE sw-KE ta-IN te-IN th-TH tr-TR uk-UA ur-PK vi-VN |
prompt | string | Style instructions to control how the text is spoken. " "Use natural language to describe the desired tone, pace, accent, " 'and emotion. For example: "Say this in a calm, professional tone" ' 'or "Speak with excitement and energy". Maximum 4,000 bytes. | "Say the following." | — |
voice | string | Voice to use for speech generation | "Kore" | Achernar Achird Algenib Algieba Alnilam Aoede Autonoe Callirrhoe Charon Despina Enceladus Erinome Fenrir Gacrux Iapetus Kore Laomedeia Leda Orus Pulcherrima Puck Rasalgethi Sadachbia Sadaltager Schedar Sulafat Umbriel Vindemiatrix Zephyr Zubenelgenubi |
text required string The text to convert to speech. Supports markup tags like " "[sigh], [laughing], [whispering], [shouting], [extremely fast] for " "expressive delivery. Maximum 4,000 bytes.
language_code string Language for the speech output
Default:
"en-US" af-ZA am-ET ar-001 ar-EG az-AZ be-BY bg-BG bn-BD ca-ES ceb-PH cmn-CN cmn-tw cs-CZ da-DK de-DE el-GR en-AU en-GB en-IN en-US es-419 es-ES es-MX et-EE eu-ES fa-IR fi-FI fil-PH fr-CA fr-FR gl-ES gu-IN he-IL hi-IN hr-HR ht-HT hu-HU hy-AM id-ID is-IS it-IT ja-JP jv-JV ka-GE kn-IN ko-KR kok-IN la-VA lb-LU lo-LA lt-LT lv-LV mai-IN mg-MG mk-MK ml-IN mn-MN mr-IN ms-MY my-MM nb-NO ne-NP nl-NL nn-NO or-IN pa-IN pl-PL ps-AF pt-BR pt-PT ro-RO ru-RU sd-IN si-LK sk-SK sl-SI sq-AL sr-RS sv-SE sw-KE ta-IN te-IN th-TH tr-TR uk-UA ur-PK vi-VN
prompt string Style instructions to control how the text is spoken. " "Use natural language to describe the desired tone, pace, accent, " 'and emotion. For example: "Say this in a calm, professional tone" ' 'or "Speak with excitement and energy". Maximum 4,000 bytes.
Default:
"Say the following." voice string Voice to use for speech generation
Default:
"Kore" Achernar Achird Algenib Algieba Alnilam Aoede Autonoe Callirrhoe Charon Despina Enceladus Erinome Fenrir Gacrux Iapetus Kore Laomedeia Leda Orus Pulcherrima Puck Rasalgethi Sadachbia Sadaltager Schedar Sulafat Umbriel Vindemiatrix Zephyr Zubenelgenubi
Version:
e165f56d71b9 Updated: 6/26/2026 205.2K runs
cinemasetfree.com