AI on Cloudron
-
Fine-Tuned CodeLlama-34B now beats ChatGPT4 on HumanEval.
https://www.phind.com/blog/code-llama-beats-gpt4https://huggingface.co/sbeall/Phind-CodeLlama-34B-v1-q5_K_M-GGUF/tree/main
You can run it on Ollama today:
https://ollama.ai/library/phind-codellama/tagsOpenRouter allows you cheap access to Free Software Language Model APIs:
https://openrouter.ai/docs#models -
Falcon-180b (that is 180 billion parameter) Free Software model has now been released.
This is the chat version:https://huggingface.co/tiiuae/falcon-180B-chat
The open source dataset was created in the UAE through a process of web crawling and stringent filtering out of "adult" sites based on their URL. It is multi-modal friendly with image tagging.
The main site is being absolutely hammered at the moment. Archive:
https://archive.ph/trCCZThis is by far the largest Free Software model available at the moment, and it is outperforming Llama 2.
500GB download, 2.8TB storage after unpacking, 400 GB of memory will be needed to swiftly run inference o Falcon-180B.
"I think we are going to need a bigger boat."
-
Petals.dev
Run large language models, including Falcon-180b, bittorrent style:
https://petals.dev/ -
@LoudLemur said in AI on Cloudron:
500GB download, 2.8TB storage after unpacking, 400 GB of memory will be needed to swiftly run inference o Falcon-180B.
I'm going to install 2 right away!
-
OpenAI ChatGPT Enterprise:
https://openai.com/blog/introducing-chatgpt-enterpriseWeβre launching ChatGPT Enterprise, which offers enterprise-grade security and privacy, unlimited higher-speed GPT-4 access, longer context windows for processing longer inputs, advanced data analysis capabilities, customization options, and much more.
-
Talk to President Obama: https://gptcall.net/
-
Chat with AI Characters Offline.
Runs locally. Zero-configuration.
https://faraday.dev/This library aims to help make gguf downloading/management easier:
https://github.com/ahoylabs/gguf.js -
@LoudLemur said in AI on Cloudron:
Talk to President Obama: https://gptcall.net/
This is indeed quite fun!
-
@nebulon said in AI on Cloudron:
@LoudLemur said in AI on Cloudron:
Talk to President Obama: https://gptcall.net/
This is indeed quite fun!
Yep, and in reality it's relatively easy, and fun too of course, to build such web UI. At the base, it's simply built through prompt engineering. Meaning a fair knowledge of how to talk to the machine, to make it spit according to your will loll
Which means such cool site could also be built by almost anyone with a good idea, using ChatGPT, or almost any good GPT LLMs, and incorporating the right "prompts" in a database like Notion (or even Directus on Cloudron? I'm testing it...), that has capabilities to publish public websites based on their databases.
As anyone ever tried, for example, to just tell ChatGPT to act as Obama or any well known personality, just before starting a conversation with it? -
Seems to be an above average chance that all our digital content creation is going to become an AI Avatar that future generations will be able to chat with. Wild times!
-
RTX3090 and Llama 70b @ 20 tokens/second:
https://old.reddit.com/r/Oobabooga/comments/16gxqzt/exllamav2_20_tokenss_for_llama270bchat_on_a_rtx/Haven't yet verified
-
@marcusquinn Somebody just created a Lora which they trained on all of the art they created at school, pencil drawings. The AI can now make art as though it were their younger self.
-
@LoudLemur We're being uploaded to the Matrix! Maybe we'll become the aliens popping in on other planets
-
@marcusquinn said in AI on Cloudron:
@LoudLemur We're being uploaded to the Matrix! Maybe we'll become the aliens popping in on other planets
That's why we 'start' lurking in the IPFS direction
-
Here's a fun one!
-
Here's an AMAZING one.
Are you ready for NExT GPT?
https://www.marktechpost.com/2023/09/14/meet-next-gpt-an-end-to-end-general-purpose-any-to-any-multimodal-large-language-models-mm-llms/Make sure to look at the video on that page.
We knew it, we ain't seen nothing yet (here's something you never will forget... bbb baby you ain't seen nothing yet ) -
Agents - a framework for Autonomous Language AI Agents:
https://github.com/aiwaves-cn/agentsThere is a very popular demo and a website:
http://www.aiwaves-agents.com/ -
"Can my GPU run this?" tool (I thought this would be more useful, but maybe somebody else can get more mileage out of it.)
https://rahulschand.github.io/gpu_poor/
https://github.com/RahulSChand/gpu_poorLoLLMS WebUI (Lord of Large Language Models: One tool to rule them all) (Elevated privileges might be needed for proper installation. After you install a model, you need to click that tick to select it for use.)
https://github.com/ParisNeo/lollms-webui
TheBloke says this is great, with many unique features. I like it because its licence is clear and Apache 2.0 -
@LoudLemur said in AI on Cloudron:
LoLLMS WebUI (Lord of Large Language Models: One tool to rule them all)
I thought it was: Laughing Out Loud at Large Language Models