Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Discuss
  3. AI on Cloudron

AI on Cloudron

Scheduled Pinned Locked Moved Discuss
a.i
250 Posts 15 Posters 92.9k Views 18 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • micmcM micmc

    Much ChatGPT competition is quickly getting into the AI race.

    Llama 2: The New Open LLM SOTA
    https://www.latent.space/p/llama2

    L Offline
    L Offline
    LoudLemur
    wrote on last edited by LoudLemur
    #53

    @micmc said in AI on Cloudron:

    SOTA

    Thanks, @micmc!

    SOTA (State Of The Art)

    People can play with Lama2 here:
    https://huggingface.co/spaces/ysharma/Explore_llamav2_with_TGI

    What I notice is that the response time between question and answer is split second. I would like to know why! If I try and run this locally, it takes AI half a minute or longer to get going, and then an age to s-l-o-w-l-y print out its reply.

    Is it a hardware issue? What would be needed to run that locally at similar speed to the hugging face demo?

    What I am hoping, is that it is not 8x Nvidia A100s

    brave_CnzBuWBnS6.png

    brave_iGWblwAmOM.png

    1 Reply Last reply
    1
    • micmcM Offline
      micmcM Offline
      micmc
      wrote on last edited by
      #54

      I guess you've come to realize the GPU power required to run a decent ML model 🙂

      One might want to experience on a lower scale there already is models that work with consumer grade hardware. That you can install on your local PC, and since it runs on Docker theoretically it should run on a VPS as well as maybe even Cloudron then. I've not yet tested this on a VPS but I'm working on it.

      Check out https://localai.io/ which does NOT require GPU and uses several GPT models as well, like llama.cpp, gpt4all.cpp and whisper.cpp and more... This site is very interesting for AI enthusiasts who want to learn more and tests things deeper, it's fascinating.

      Enjoy!

      Ignorance is not an excuse anymore!
      https://AutomateKit.com

      L 1 Reply Last reply
      2
      • micmcM Offline
        micmcM Offline
        micmc
        wrote on last edited by
        #55

        I'm not sure if anyone here, power users, has realized that there already is AI tools and modules being added to Cloudron via several app carried and maintained by our fearless developers team here.

        First I discover is in Joplin as I'm a Joplin power user (this app is amazing) and there exists several plugins to add to enhance the app and one of them is called Jarvis and offer SEVERAL AI features to use right inside your notes everywhere.

        Then recently in the latest major release of NextCloud they've added an OpenAI integration module.

        And just now, in a major release of ChatWoot: OpenAI integration : ( Reply suggestions, summarization, and ability to improve drafts ). This is just a beginning this app was already very much powerful and getting more and more almost by the day.

        I guess, RocketChat and MatterMost* might quickly follow if it's not already done. (?)

        Roughly, I know it can also be integrated into LibreOffice somehow.

        What else, anyone?

        Ignorance is not an excuse anymore!
        https://AutomateKit.com

        L 2 Replies Last reply
        3
        • micmcM micmc

          I'm not sure if anyone here, power users, has realized that there already is AI tools and modules being added to Cloudron via several app carried and maintained by our fearless developers team here.

          First I discover is in Joplin as I'm a Joplin power user (this app is amazing) and there exists several plugins to add to enhance the app and one of them is called Jarvis and offer SEVERAL AI features to use right inside your notes everywhere.

          Then recently in the latest major release of NextCloud they've added an OpenAI integration module.

          And just now, in a major release of ChatWoot: OpenAI integration : ( Reply suggestions, summarization, and ability to improve drafts ). This is just a beginning this app was already very much powerful and getting more and more almost by the day.

          I guess, RocketChat and MatterMost* might quickly follow if it's not already done. (?)

          Roughly, I know it can also be integrated into LibreOffice somehow.

          What else, anyone?

          L Offline
          L Offline
          LoudLemur
          wrote on last edited by
          #56

          @micmc said in AI on Cloudron:

          What else, anyone?

          There is some AI tagging of faces in the image applications, for example, Immich.

          I think OCR (Optical Character Recognition) extraction of text from images could count as AI, too.

          By the way, @micmc, thank you very much indeed for your brilliant links which are very worthwhile following.

          1 Reply Last reply
          2
          • L Offline
            L Offline
            LoudLemur
            wrote on last edited by
            #57

            We know Automatic1111 and Serge, there is also oobabooga, which is like a Serge rival, I suppose and which can install Llama2:
            https://github.com/oobabooga/text-generation-webui
            there is a video explaining how to do it here:
            https://vid.puffyan.us/watch?v=SbuhznykQBg&quality=dash

            1 Reply Last reply
            1
            • micmcM micmc

              I guess you've come to realize the GPU power required to run a decent ML model 🙂

              One might want to experience on a lower scale there already is models that work with consumer grade hardware. That you can install on your local PC, and since it runs on Docker theoretically it should run on a VPS as well as maybe even Cloudron then. I've not yet tested this on a VPS but I'm working on it.

              Check out https://localai.io/ which does NOT require GPU and uses several GPT models as well, like llama.cpp, gpt4all.cpp and whisper.cpp and more... This site is very interesting for AI enthusiasts who want to learn more and tests things deeper, it's fascinating.

              Enjoy!

              L Offline
              L Offline
              LoudLemur
              wrote on last edited by LoudLemur
              #58

              @micmc said in AI on Cloudron:

              I guess you've come to realize the GPU power required to run a decent ML model

              I sure have! Some people have commented that Meta deliberately didn't release a 30b version of Llama2 as that would be just about possible to run on consumer grade hardware.

              That 70b Llama2 I tried on huggingface chat and I loved it! I want to try and host it, and am trying to figure out how to go about doing that. This is what I am looking at. Maybe others here have some better ideas!

              https://xethub.com/XetHub/Llama2

              brave_z1bOx9ZmdR.png

              Is it relatively easy to just "turn on" a pre-saved instance when you are ready for a session and then go full-steam for a couple of hours, then when you are finished for the day, "turn off" and just pay the $4? It is a highly efficient.

              brave_8CbxzEvhhR.png

              1 Reply Last reply
              0
              • humptydumptyH Offline
                humptydumptyH Offline
                humptydumpty
                wrote on last edited by humptydumpty
                #59

                I was reading through localai.io github pages and it mentions this nifty tool for upscaling images (Upscayl). It converted a 640px image to 2560px and the detail improvement is insane. No more blurriness or pixelation, and it even enhances the colors to make it "pop". I used the ULTRASHARP mode. The first (default) mode in the list does the same job minus the color pop. Here are the images for comparison.

                640px ring
                ring.png


                2560px upscayle + compressed (TinyPNG; forum limits size to 4MB)

                TINY_ring_upscayl_4x_ultrasharp.png

                1 Reply Last reply
                6
                • micmcM Offline
                  micmcM Offline
                  micmc
                  wrote on last edited by
                  #60

                  Talking about LocalAI.io There is now a OpenAI plugin that can be added to OnlyOffice in NextCloud.
                  You can use OpenAI API by putting your API key, but it can also be use, and this is what they even recommend, with a LocalAI instance running on the same server as NextCloud.

                  I guess we should start looking if a local instance of LocalAI could be run smoothly on a LAMP instance on Cloudron, and even be added as an app to Cloudron since it is built to run on Docker already?

                  Ignorance is not an excuse anymore!
                  https://AutomateKit.com

                  robiR 1 Reply Last reply
                  5
                  • micmcM micmc

                    Talking about LocalAI.io There is now a OpenAI plugin that can be added to OnlyOffice in NextCloud.
                    You can use OpenAI API by putting your API key, but it can also be use, and this is what they even recommend, with a LocalAI instance running on the same server as NextCloud.

                    I guess we should start looking if a local instance of LocalAI could be run smoothly on a LAMP instance on Cloudron, and even be added as an app to Cloudron since it is built to run on Docker already?

                    robiR Offline
                    robiR Offline
                    robi
                    wrote on last edited by
                    #61

                    @micmc https://forum.cloudron.io/topic/9399/how-to-run-ai-models-in-lamp-app

                    Conscious tech

                    micmcM 1 Reply Last reply
                    4
                    • robiR robi

                      @micmc https://forum.cloudron.io/topic/9399/how-to-run-ai-models-in-lamp-app

                      micmcM Offline
                      micmcM Offline
                      micmc
                      wrote on last edited by
                      #62

                      @robi said in AI on Cloudron:

                      @micmc https://forum.cloudron.io/topic/9399/how-to-run-ai-models-in-lamp-app

                      Exactly what we need to deepened right!
                      Thanks mate.

                      Ignorance is not an excuse anymore!
                      https://AutomateKit.com

                      1 Reply Last reply
                      0
                      • L Offline
                        L Offline
                        LoudLemur
                        wrote on last edited by
                        #63

                        Stackoverflow usage has plummeted with people switching to AI for solutions, so they have now launched Overflow AI:

                        https://stackoverflow.co/labs/

                        1 Reply Last reply
                        3
                        • micmcM micmc

                          I'm not sure if anyone here, power users, has realized that there already is AI tools and modules being added to Cloudron via several app carried and maintained by our fearless developers team here.

                          First I discover is in Joplin as I'm a Joplin power user (this app is amazing) and there exists several plugins to add to enhance the app and one of them is called Jarvis and offer SEVERAL AI features to use right inside your notes everywhere.

                          Then recently in the latest major release of NextCloud they've added an OpenAI integration module.

                          And just now, in a major release of ChatWoot: OpenAI integration : ( Reply suggestions, summarization, and ability to improve drafts ). This is just a beginning this app was already very much powerful and getting more and more almost by the day.

                          I guess, RocketChat and MatterMost* might quickly follow if it's not already done. (?)

                          Roughly, I know it can also be integrated into LibreOffice somehow.

                          What else, anyone?

                          L Offline
                          L Offline
                          LoudLemur
                          wrote on last edited by
                          #64

                          @micmc said in AI on Cloudron:

                          Then recently in the latest major release of NextCloud they've added an OpenAI integration module.

                          Nextcloud have an article on this:
                          https://nextcloud.com/blog/ai-in-nextcloud-what-why-and-how/

                          1 Reply Last reply
                          2
                          • L Offline
                            L Offline
                            LoudLemur
                            wrote on last edited by
                            #65

                            Prompt Engineering World Championships:
                            https://app.openpipe.ai/world-champs/signup

                            RoboCup 2023 - AI/Robot football:
                            https://www.robocup.org/
                            https://vid.puffyan.us/watch?v=vwIuQKKg-sY&quality=dash

                            1 Reply Last reply
                            2
                            • L Offline
                              L Offline
                              LoudLemur
                              wrote on last edited by
                              #66

                              Nvidia reveal new chip:
                              https://nvidianews.nvidia.com/news/gh200-grace-hopper-superchip-with-hbm3e-memory

                              1 Reply Last reply
                              0
                              • L Offline
                                L Offline
                                LoudLemur
                                wrote on last edited by
                                #67

                                Useful collection of AI prompts:
                                https://huggingface.co/datasets/fka/awesome-chatgpt-prompts

                                1 Reply Last reply
                                4
                                • L Offline
                                  L Offline
                                  LoudLemur
                                  wrote on last edited by
                                  #68

                                  Has anybody had success finetuning a language model so that it becomes an expert on local data? What free software tools were used and was it an enjoyable, fruitful process?

                                  micmcM 1 Reply Last reply
                                  1
                                  • L LoudLemur

                                    Has anybody had success finetuning a language model so that it becomes an expert on local data? What free software tools were used and was it an enjoyable, fruitful process?

                                    micmcM Offline
                                    micmcM Offline
                                    micmc
                                    wrote on last edited by
                                    #69

                                    @LoudLemur said in AI on Cloudron:

                                    Has anybody had success finetuning a language model so that it becomes an expert on local data? What free software tools were used and was it an enjoyable, fruitful process?

                                    For my part, I'd been trying to play a little but not much yet, I'm actually busy working on developing and launching a SaaS Web UI through API connections to different LLM providers.
                                    The service makes it very easy to anyone to create content (and well, much more like consulting with powerful coach bots in several fields) with a few clicks, and answering a few questions.
                                    Local data labs is next...

                                    Ignorance is not an excuse anymore!
                                    https://AutomateKit.com

                                    L 1 Reply Last reply
                                    2
                                    • micmcM micmc

                                      @LoudLemur said in AI on Cloudron:

                                      Has anybody had success finetuning a language model so that it becomes an expert on local data? What free software tools were used and was it an enjoyable, fruitful process?

                                      For my part, I'd been trying to play a little but not much yet, I'm actually busy working on developing and launching a SaaS Web UI through API connections to different LLM providers.
                                      The service makes it very easy to anyone to create content (and well, much more like consulting with powerful coach bots in several fields) with a few clicks, and answering a few questions.
                                      Local data labs is next...

                                      L Offline
                                      L Offline
                                      LoudLemur
                                      wrote on last edited by
                                      #70

                                      @micmc https://www.deeplearning.ai/short-courses/finetuning-large-language-models

                                      1 Reply Last reply
                                      1
                                      • L Offline
                                        L Offline
                                        LoudLemur
                                        wrote on last edited by LoudLemur
                                        #71

                                        Fine-Tuned CodeLlama-34B now beats ChatGPT4 on HumanEval.
                                        https://www.phind.com/blog/code-llama-beats-gpt4

                                        https://huggingface.co/Phind

                                        https://huggingface.co/sbeall/Phind-CodeLlama-34B-v1-q5_K_M-GGUF/tree/main

                                        You can run it on Ollama today:
                                        https://ollama.ai/library/phind-codellama/tags

                                        OpenRouter allows you cheap access to Free Software Language Model APIs:
                                        https://openrouter.ai/docs#models

                                        brave_ZrqYQKLa0W.png

                                        1 Reply Last reply
                                        3
                                        • L Offline
                                          L Offline
                                          LoudLemur
                                          wrote on last edited by LoudLemur
                                          #72

                                          Falcon-180b (that is 180 billion parameter) Free Software model has now been released.
                                          This is the chat version:

                                          https://huggingface.co/tiiuae/falcon-180B-chat

                                          The open source dataset was created in the UAE through a process of web crawling and stringent filtering out of "adult" sites based on their URL. It is multi-modal friendly with image tagging.

                                          The main site is being absolutely hammered at the moment. Archive:
                                          https://archive.ph/trCCZ

                                          This is by far the largest Free Software model available at the moment, and it is outperforming Llama 2.

                                          500GB download, 2.8TB storage after unpacking, 400 GB of memory will be needed to swiftly run inference o Falcon-180B.

                                          "I think we are going to need a bigger boat."

                                          Demo: https://huggingface.co/spaces/tiiuae/falcon-180b-demo

                                          micmcM 1 Reply Last reply
                                          1
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • Bookmarks
                                          • Search