openai/gpt-4o-transcribe

A speech-to-text model that uses GPT-4o to transcribe audio

Capabilities

No capability data available

Community model (estimated from hardware time)

Name	Type	Description	Default	Constraints
`audio_file`*	string(uri)	The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm	`—`	—
`language`	string	The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.	`—`	—
`prompt`	string	An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.	`—`	—
`temperature`	number	Sampling temperature between 0 and 1	`0`	min: 0, max: 1

audio_filerequiredstring

The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm

languagestring

The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.

promptstring

An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.

temperaturenumber

Sampling temperature between 0 and 1

Default: 0min: 0, max: 1

Version: cc7638666fc8Updated: 7/25/202635.8K runs