Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. OpenWebUI
  3. Installing Whisper

Installing Whisper

Scheduled Pinned Locked Moved OpenWebUI
11 Posts 3 Posters 1.3k Views 3 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • andreasduerenA Offline
    andreasduerenA Offline
    andreasdueren
    wrote on last edited by
    #1

    Out of curiosity because a quick search didn't yield the results I was looking for: Is it possible to install Whisper in OpenWebUI? Doesn't seem so.

    1 Reply Last reply
    0
    • firmansiF Online
      firmansiF Online
      firmansi
      wrote on last edited by
      #2

      What do you mean by installing whisper?

      andreasduerenA 1 Reply Last reply
      0
      • firmansiF firmansi

        What do you mean by installing whisper?

        andreasduerenA Offline
        andreasduerenA Offline
        andreasdueren
        wrote on last edited by
        #3

        @firmansi Running the OpenAI Whisper model for transcription purposes

        1 Reply Last reply
        0
        • firmansiF Online
          firmansiF Online
          firmansi
          wrote on last edited by
          #4

          You mean from voice to text? If that you mean then you can enable it through Admin Settting > Audio

          andreasduerenA 1 Reply Last reply
          0
          • firmansiF firmansi

            You mean from voice to text? If that you mean then you can enable it through Admin Settting > Audio

            andreasduerenA Offline
            andreasduerenA Offline
            andreasdueren
            wrote on last edited by andreasdueren
            #5

            @firmansi Hmm I see. But how do I select that then in the interface? I.e. I have an audio file. What are the steps in OpenWebUI to get the transcription from said file.

            1 Reply Last reply
            0
            • firmansiF Online
              firmansiF Online
              firmansi
              wrote on last edited by
              #6

              Steps will be

              1. Choose OpenAI STT setting
              2. Fill openAI API
              3. Fill API Key
              4. Fill STT Model with whisper-large-v3 or any other that you prefer
              andreasduerenA 1 Reply Last reply
              1
              • firmansiF firmansi

                Steps will be

                1. Choose OpenAI STT setting
                2. Fill openAI API
                3. Fill API Key
                4. Fill STT Model with whisper-large-v3 or any other that you prefer
                andreasduerenA Offline
                andreasduerenA Offline
                andreasdueren
                wrote on last edited by
                #7

                @firmansi said in Installing Whisper:

                Steps will be

                1. Choose OpenAI STT setting
                2. Fill openAI API
                3. Fill API Key
                4. Fill STT Model with whisper-large-v3 or any other that you prefer

                Screenshot 2024-12-28 at 09.47.29.png

                Yes I#ve gotten that far. But when you select a new chat, I can not select the whisper models.

                1 Reply Last reply
                0
                • andreasduerenA Offline
                  andreasduerenA Offline
                  andreasdueren
                  wrote on last edited by
                  #8

                  Seams like I'm not the only one struggling to understand the whisper implementation: https://github.com/open-webui/open-webui/issues/2248

                  1 Reply Last reply
                  1
                  • robiR Offline
                    robiR Offline
                    robi
                    wrote on last edited by robi
                    #9

                    Do you have a GPU for this? Otherwise it will be very slow.

                    There are speedups that have been made with JAX, but that still runs on GPUs.

                    Once you get it working, let me know how long it takes for x minutes of audio.

                    Conscious tech

                    andreasduerenA 1 Reply Last reply
                    0
                    • robiR robi

                      Do you have a GPU for this? Otherwise it will be very slow.

                      There are speedups that have been made with JAX, but that still runs on GPUs.

                      Once you get it working, let me know how long it takes for x minutes of audio.

                      andreasduerenA Offline
                      andreasduerenA Offline
                      andreasdueren
                      wrote on last edited by
                      #10

                      @robi Nope, just CPU. But seems to be very manageable. Faster Whisper says in the repo that a 13 minute on 8 threads Intel Core i7-12700K CPU takes roughly a minute.

                      1 Reply Last reply
                      1
                      • andreasduerenA Offline
                        andreasduerenA Offline
                        andreasdueren
                        wrote on last edited by
                        #11

                        BTW it seems to automatically recognize any audio file you throw at it. Either via Microphone or file upload and process TTS locally before passing it on to the selected model. Logs and graphs are pretty clear about this. Documentation however is not lol.

                        1 Reply Last reply
                        1
                        Reply
                        • Reply as topic
                        Log in to reply
                        • Oldest to Newest
                        • Newest to Oldest
                        • Most Votes


                        • Login

                        • Don't have an account? Register

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • Bookmarks
                        • Search