Installing Whisper
-
wrote on Dec 27, 2024, 6:01 PM last edited by
Out of curiosity because a quick search didn't yield the results I was looking for: Is it possible to install Whisper in OpenWebUI? Doesn't seem so.
-
wrote on Dec 28, 2024, 12:25 AM last edited by
What do you mean by installing whisper?
-
wrote on Dec 28, 2024, 8:27 AM last edited by
@firmansi Running the OpenAI Whisper model for transcription purposes
-
wrote on Dec 28, 2024, 8:28 AM last edited by
You mean from voice to text? If that you mean then you can enable it through Admin Settting > Audio
-
You mean from voice to text? If that you mean then you can enable it through Admin Settting > Audio
wrote on Dec 28, 2024, 8:35 AM last edited by andreasdueren Dec 28, 2024, 8:45 AM@firmansi Hmm I see. But how do I select that then in the interface? I.e. I have an audio file. What are the steps in OpenWebUI to get the transcription from said file.
-
wrote on Dec 28, 2024, 8:46 AM last edited by
Steps will be
- Choose OpenAI STT setting
- Fill openAI API
- Fill API Key
- Fill STT Model with whisper-large-v3 or any other that you prefer
-
Steps will be
- Choose OpenAI STT setting
- Fill openAI API
- Fill API Key
- Fill STT Model with whisper-large-v3 or any other that you prefer
wrote on Dec 28, 2024, 8:48 AM last edited by@firmansi said in Installing Whisper:
Steps will be
- Choose OpenAI STT setting
- Fill openAI API
- Fill API Key
- Fill STT Model with whisper-large-v3 or any other that you prefer
Yes I#ve gotten that far. But when you select a new chat, I can not select the whisper models.
-
wrote on Dec 28, 2024, 11:55 AM last edited by
Seams like I'm not the only one struggling to understand the whisper implementation: https://github.com/open-webui/open-webui/issues/2248
-
wrote on Dec 28, 2024, 11:30 PM last edited by robi Dec 28, 2024, 11:31 PM
Do you have a GPU for this? Otherwise it will be very slow.
There are speedups that have been made with JAX, but that still runs on GPUs.
Once you get it working, let me know how long it takes for x minutes of audio.
-
Do you have a GPU for this? Otherwise it will be very slow.
There are speedups that have been made with JAX, but that still runs on GPUs.
Once you get it working, let me know how long it takes for x minutes of audio.
wrote on Dec 29, 2024, 11:08 AM last edited by@robi Nope, just CPU. But seems to be very manageable. Faster Whisper says in the repo that a 13 minute on 8 threads Intel Core i7-12700K CPU takes roughly a minute.
-
wrote on Jan 14, 2025, 1:13 PM last edited by
BTW it seems to automatically recognize any audio file you throw at it. Either via Microphone or file upload and process TTS locally before passing it on to the selected model. Logs and graphs are pretty clear about this. Documentation however is not lol.