← Back to all generators

openai/gpt-4o-transcribe

A speech-to-text model that uses GPT-4o to transcribe audio

Capabilities

No capability data available

Cost

Community model (estimated from hardware time)

Input Parameters

audio_file required string

The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm

language string

The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.

prompt string

An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.

temperature number

Sampling temperature between 0 and 1

Default: 0 min: 0, max: 1
Version: cc7638666fc8 Updated: 2/26/2026 35.8K runs