AI on Cloudron

LoudLemur

Fine-Tuned CodeLlama-34B now beats ChatGPT4 on HumanEval.
https://www.phind.com/blog/code-llama-beats-gpt4

https://huggingface.co/Phind

https://huggingface.co/sbeall/Phind-CodeLlama-34B-v1-q5_K_M-GGUF/tree/main

You can run it on Ollama today:
https://ollama.ai/library/phind-codellama/tags

OpenRouter allows you cheap access to Free Software Language Model APIs:
https://openrouter.ai/docs#models

LoudLemur

Falcon-180b (that is 180 billion parameter) Free Software model has now been released.
This is the chat version:

https://huggingface.co/tiiuae/falcon-180B-chat

The open source dataset was created in the UAE through a process of web crawling and stringent filtering out of "adult" sites based on their URL. It is multi-modal friendly with image tagging.

The main site is being absolutely hammered at the moment. Archive:
https://archive.ph/trCCZ

This is by far the largest Free Software model available at the moment, and it is outperforming Llama 2.

500GB download, 2.8TB storage after unpacking, 400 GB of memory will be needed to swiftly run inference o Falcon-180B.

"I think we are going to need a bigger boat."

Demo: https://huggingface.co/spaces/tiiuae/falcon-180b-demo

LoudLemur

Petals.dev

Run large language models, including Falcon-180b, bittorrent style:
https://petals.dev/

micmc

@LoudLemur said in AI on Cloudron:

500GB download, 2.8TB storage after unpacking, 400 GB of memory will be needed to swiftly run inference o Falcon-180B.

I'm going to install 2 right away!

LoudLemur

OpenAI ChatGPT Enterprise:
https://openai.com/blog/introducing-chatgpt-enterprise

We’re launching ChatGPT Enterprise, which offers enterprise-grade security and privacy, unlimited higher-speed GPT-4 access, longer context windows for processing longer inputs, advanced data analysis capabilities, customization options, and much more.

LoudLemur

Talk to President Obama: https://gptcall.net/

LoudLemur

Chat with AI Characters Offline.
Runs locally. Zero-configuration.
https://faraday.dev/

This library aims to help make gguf downloading/management easier:
https://github.com/ahoylabs/gguf.js

nebulon

@LoudLemur said in AI on Cloudron:

Talk to President Obama: https://gptcall.net/

This is indeed quite fun!

micmc

@nebulon said in AI on Cloudron:

@LoudLemur said in AI on Cloudron:

Talk to President Obama: https://gptcall.net/

This is indeed quite fun!

Yep, and in reality it's relatively easy, and fun too of course, to build such web UI. At the base, it's simply built through prompt engineering. Meaning a fair knowledge of how to talk to the machine, to make it spit according to your will loll
Which means such cool site could also be built by almost anyone with a good idea, using ChatGPT, or almost any good GPT LLMs, and incorporating the right "prompts" in a database like Notion (or even Directus on Cloudron? I'm testing it...), that has capabilities to publish public websites based on their databases.
As anyone ever tried, for example, to just tell ChatGPT to act as Obama or any well known personality, just before starting a conversation with it?

marcusquinn

Seems to be an above average chance that all our digital content creation is going to become an AI Avatar that future generations will be able to chat with. Wild times!

LoudLemur

RTX3090 and Llama 70b @ 20 tokens/second:
https://old.reddit.com/r/Oobabooga/comments/16gxqzt/exllamav2_20_tokenss_for_llama270bchat_on_a_rtx/

Haven't yet verified

https://github.com/turboderp/exllamav2

LoudLemur

@marcusquinn Somebody just created a Lora which they trained on all of the art they created at school, pencil drawings. The AI can now make art as though it were their younger self.

marcusquinn

@LoudLemur We're being uploaded to the Matrix! Maybe we'll become the aliens popping in on other planets

micmc

@marcusquinn said in AI on Cloudron:

@LoudLemur We're being uploaded to the Matrix! Maybe we'll become the aliens popping in on other planets

That's why we 'start' lurking in the IPFS direction

marcusquinn

Here's a fun one!

https://labs.heygen.com/video-translate

micmc

Here's an AMAZING one.

Are you ready for NExT GPT?
https://www.marktechpost.com/2023/09/14/meet-next-gpt-an-end-to-end-general-purpose-any-to-any-multimodal-large-language-models-mm-llms/

Make sure to look at the video on that page.
We knew it, we ain't seen nothing yet (here's something you never will forget... bbb baby you ain't seen nothing yet )

LoudLemur

Agents - a framework for Autonomous Language AI Agents:
https://github.com/aiwaves-cn/agents

There is a very popular demo and a website:
http://www.aiwaves-agents.com/

LoudLemur

"Can my GPU run this?" tool (I thought this would be more useful, but maybe somebody else can get more mileage out of it.)

https://rahulschand.github.io/gpu_poor/
https://github.com/RahulSChand/gpu_poor

LoLLMS WebUI (Lord of Large Language Models: One tool to rule them all) (Elevated privileges might be needed for proper installation. After you install a model, you need to click that tick to select it for use.)

https://github.com/ParisNeo/lollms-webui
TheBloke says this is great, with many unique features. I like it because its licence is clear and Apache 2.0

micmc

@LoudLemur said in AI on Cloudron:

LoLLMS WebUI (Lord of Large Language Models: One tool to rule them all)

I thought it was: Laughing Out Loud at Large Language Models

robi

@micmc Well we already have a company named in the same space.

Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.

Cloudron Forum

AI on Cloudron