Modelsopenaiwhisper

Convert speech in audio to text

Input

Audio file

Choose the format for the transcription

Disabled

Translate the text to English when set to True

Language spoken in the audio, specify 'auto' for automatic language detection

temperature to use for sampling

optional patience value to use in beam decoding, as in https://arxiv.org/abs/2204.05424, the default (1.0) is equivalent to conventional beam search

comma-separated list of token ids to suppress during sampling; '-1' will suppress most special characters except common punctuations

optional text to provide as a prompt for the first window.

Disabled

if True, provide the previous output of the model as a prompt for the next window; disabling may make the text inconsistent across windows, but the model becomes less prone to getting stuck in a failure loop

temperature to increase when falling back when the decoding fails to meet either of the thresholds below

if the gzip compression ratio is higher than this value, treat the decoding as failed

if the average log probability is lower than this value, treat the decoding as failed

if the probability of the <|nospeech|> token is higher than this value AND the decoding has failed due to `logprob_threshold`, consider the segment as silence

Ready to Create

Configure the model parameters in the sidebar and click Run to generate content.