← Back to all generators
openai/gpt-4o-mini-transcribe
Official
View on Replicate →
A speech-to-text model that uses GPT-4o mini to transcribe audio
Capabilities
No capability data available
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
audio_file * | string (uri) | The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm | — | — |
language | string | The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency. | — | — |
prompt | string | An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language. | — | — |
temperature | number | Sampling temperature between 0 and 1 | 0 | min: 0, max: 1 |
audio_file required string The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm
language string The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.
prompt string An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
temperature number Sampling temperature between 0 and 1
Default:
0 min: 0, max: 1 Version:
684265b6c4d2 Updated: 2/26/2026 11.4K runs
cinemasetfree.com