← Back to all generators

rafaelgalle/whisper-diarization-advanced

Ultra-fast, customizable speech-to-text and speaker diarization for noisy, multi-speaker audio. Includes advanced noise reduction, stereo channel support, and flexible audio preprocessing—ideal for call centers, meetings, and podcasts.

Capabilities

No capability data available

Cost

Community model (estimated from hardware time)

Input Parameters

file_path string

Audio file

file_string string

Base64 audio

file_url string

Direct URL

highpass_freq integer
Default: 45
language string

Language code like 'en', 'pt'

lowpass_freq integer
Default: 8000
num_speakers integer

Leave empty to autodetect

min: 1, max: 50
preprocess integer

0=None, 1=Sanitize, 2=+Filter, 3=+ReduceNoise, 4=+Normalize

Default: 0 min: 0, max: 4
prompt string

Names/acronyms, separated by punctuation

prop_decrease number
Default: 0.3
stationary boolean
Default: true
target_dBFS number
Default: -18
translate boolean

Translate to English

Default: false
Version: 56dcb55b658e Updated: 2/26/2026 387.3K runs