<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Old CPU &#x2F; No GPU &#x2F; Ollama Language Model?]]></title><description><![CDATA[<p dir="auto">gemma3:1b</p>
<p dir="auto">This works. we gave it a whopping amount of RAM  (32GB). You might be able to get it to run with less RAM.</p>
<p dir="auto">qwen3:4b was too slow and hit the proxy timeout.</p>
<p dir="auto">In the Ollama terminal you can set some environment variables to help too:</p>
<pre><code>export OLLAMA_KEEP_ALIVE=24h
export OLLAMA_FLASH_ATTENTION=false
</code></pre>
<p dir="auto">OLLAMA_KEEP_ALIVE<br />
24h<br />
Keeps the model loaded in RAM (prevents reloading every request)<br />
OLLAMA_FLASH_ATTENTION<br />
false<br />
More stable on older CPUs</p>
<p dir="auto">After you have Ollama running on cloudron and have its api key, you can go into the Ollama terminal and:</p>
<pre><code>ollama pull gemma3:1b
</code></pre>
<p dir="auto">Then, using your own URL and your own API token, you can run this from your local machine to get gemma to tell you a joke and see if it is working:</p>
<pre><code>curl -X POST "https://YOUR_REAL_OLLAMA_URL/api/chat" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -d '{
    "model": "gemma3:1b",
    "messages": [{"role": "user", "content": "Hello! Tell me a short joke."}],
    "stream": false,
    "options": {
      "num_ctx": 1024,
      "num_thread": 6
    }
  }' | jq
</code></pre>
<p dir="auto">You will hopefully see a joke in the output and maybe some smilies laughing!<img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f600.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--grinning" style="height:23px;width:auto;vertical-align:middle" title=":grinning:" alt="😀" /></p>
]]></description><link>https://forum.cloudron.io/topic/15639/old-cpu-no-gpu-ollama-language-model</link><generator>RSS for Node</generator><lastBuildDate>Wed, 01 Jul 2026 17:36:52 GMT</lastBuildDate><atom:link href="https://forum.cloudron.io/topic/15639.rss" rel="self" type="application/rss+xml"/><pubDate>Wed, 24 Jun 2026 14:52:24 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Fri, 26 Jun 2026 16:25:55 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/andreasdueren" aria-label="Profile: andreasdueren">@<bdi>andreasdueren</bdi></a> Thank you for Hermes! It is a great choice for us and it also tells a funny chicken joke!</p>
<p dir="auto">hermes-4.3-36b</p>
<p dir="auto">Sure! Here's a clucktastic one:</p>
<p dir="auto">Why did the chicken join a band?</p>
<p dir="auto">To learn how to make some "eggcellent" beats! <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f414.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--chicken" style="height:23px;width:auto;vertical-align:middle" title="🐔" alt="🐔" /><img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f3b8.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--guitar" style="height:23px;width:auto;vertical-align:middle" title="🎸" alt="🎸" /></p>
<p dir="auto">(If you want more, just say the word!)</p>
]]></description><link>https://forum.cloudron.io/post/126160</link><guid isPermaLink="true">https://forum.cloudron.io/post/126160</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Fri, 26 Jun 2026 16:25:55 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Fri, 26 Jun 2026 15:23:11 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/timconsidine" aria-label="Profile: timconsidine">@<bdi>timconsidine</bdi></a> <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f920.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--face_with_cowboy_hat" style="height:23px;width:auto;vertical-align:middle" title=":face_with_cowboy_hat:" alt="🤠" /></p>
]]></description><link>https://forum.cloudron.io/post/126157</link><guid isPermaLink="true">https://forum.cloudron.io/post/126157</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Fri, 26 Jun 2026 15:23:11 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Fri, 26 Jun 2026 15:03:30 GMT]]></title><description><![CDATA[<p dir="auto">Why did the AI engine search for a chicken joke ?</p>
<p dir="auto">Because it was looking for poultry in motion! <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f414.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--chicken" style="height:23px;width:auto;vertical-align:middle" title="🐔" alt="🐔" /></p>
]]></description><link>https://forum.cloudron.io/post/126155</link><guid isPermaLink="true">https://forum.cloudron.io/post/126155</guid><dc:creator><![CDATA[timconsidine]]></dc:creator><pubDate>Fri, 26 Jun 2026 15:03:30 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Fri, 26 Jun 2026 14:30:38 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/joseph" aria-label="Profile: joseph">@<bdi>joseph</bdi></a> Thanks for this story. We asked a smallish model (Qwen 9B) running on a lot of VRAM to tell us a joke.</p>
<p dir="auto">Qwen didn't tell us a joke, it just started thinking about which chicken joke to tell us. <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f414.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--chicken" style="height:23px;width:auto;vertical-align:middle" title=":chicken:" alt="🐔" /></p>
<p dir="auto">We looked at its thinking and it had created and considered over 200 chicken jokes before we decided the best thing to do was ... not wait for it!</p>
<p dir="auto">We wish we had left it running to find out which joke it would have eventually chosen for us!</p>
]]></description><link>https://forum.cloudron.io/post/126152</link><guid isPermaLink="true">https://forum.cloudron.io/post/126152</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Fri, 26 Jun 2026 14:30:38 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Fri, 26 Jun 2026 00:16:31 GMT]]></title><description><![CDATA[<p dir="auto">Yes, compare it to LFM2.5-270M and 350M which are being geared for on phone inference. The prompts you are used to need to change though and be much more explicit so they don't feel so dumb.</p>
]]></description><link>https://forum.cloudron.io/post/126137</link><guid isPermaLink="true">https://forum.cloudron.io/post/126137</guid><dc:creator><![CDATA[robi]]></dc:creator><pubDate>Fri, 26 Jun 2026 00:16:31 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 15:24:40 GMT]]></title><description><![CDATA[<blockquote>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/robi" aria-label="Profile: robi">@<bdi>robi</bdi></a> <a href="/post/126118">said</a>:</p>
<p dir="auto">200 tok/sec no less</p>
</blockquote>
<p dir="auto">nice!</p>
]]></description><link>https://forum.cloudron.io/post/126119</link><guid isPermaLink="true">https://forum.cloudron.io/post/126119</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Thu, 25 Jun 2026 15:24:40 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 15:21:41 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/loudlemur" aria-label="Profile: LoudLemur">@<bdi>LoudLemur</bdi></a> of course it can and at 200 tok/sec no less.</p>
<p dir="auto">It's just makes a lot of mistakes. Had trouble tool calling and web access</p>
]]></description><link>https://forum.cloudron.io/post/126118</link><guid isPermaLink="true">https://forum.cloudron.io/post/126118</guid><dc:creator><![CDATA[robi]]></dc:creator><pubDate>Thu, 25 Jun 2026 15:21:41 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 15:18:50 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/robi" aria-label="Profile: robi">@<bdi>robi</bdi></a> Can it tell jokes?</p>
<p dir="auto">I had to find out!<br />
You be the judge:</p>
<p dir="auto">Tell us a quick joke about a chicken</p>
<p dir="auto">Why don't chicken birds fly? <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f414.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--chicken" style="height:23px;width:auto;vertical-align:middle" title=":chicken:" alt="🐔" /><br />
Because they are too small.</p>
<p dir="auto"><img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f923.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--rolling_on_the_floor_laughing" style="height:23px;width:auto;vertical-align:middle" title=":rolling_on_the_floor_laughing:" alt="🤣" /></p>
]]></description><link>https://forum.cloudron.io/post/126116</link><guid isPermaLink="true">https://forum.cloudron.io/post/126116</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Thu, 25 Jun 2026 15:18:50 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 15:10:15 GMT]]></title><description><![CDATA[<p dir="auto">There's a gemma3 270m instruct model that is fast, but it's also not very smart</p>
]]></description><link>https://forum.cloudron.io/post/126115</link><guid isPermaLink="true">https://forum.cloudron.io/post/126115</guid><dc:creator><![CDATA[robi]]></dc:creator><pubDate>Thu, 25 Jun 2026 15:10:15 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 15:06:04 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/joseph" aria-label="Profile: joseph">@<bdi>joseph</bdi></a></p>
<p dir="auto">Hey, I hope it tells you that joke some day, Joseph! :<img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f921.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--clown_face" style="height:23px;width:auto;vertical-align:middle" title=":clown_face:" alt="🤡" /></p>
<p dir="auto">In the meanwhile, here is one it told me:</p>
<p dir="auto">Why did the chicken cross the playground? <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f414.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--chicken" style="height:23px;width:auto;vertical-align:middle" title=":chicken:" alt="🐔" /></p>
<p dir="auto">To get to the other slide! <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f923.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--rolling_on_the_floor_laughing" style="height:23px;width:auto;vertical-align:middle" title=":rolling_on_the_floor_laughing:" alt="🤣" /></p>
<p dir="auto">It is pretty "low-VRAM" humour!</p>
]]></description><link>https://forum.cloudron.io/post/126113</link><guid isPermaLink="true">https://forum.cloudron.io/post/126113</guid><dc:creator><![CDATA[LoudLemur]]></dc:creator><pubDate>Thu, 25 Jun 2026 15:06:04 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 08:50:28 GMT]]></title><description><![CDATA[<p dir="auto">Sadly there's currently no substitute for RAM or VRAM.<br />
My Mac silicon chip laptop does an ok-ish job with 24Gb RAM (integrated CPU/GPU memory model)<br />
But mostly I just accept defeat and use Ollama Cloud models (or Venice T2EE cloud models).</p>
]]></description><link>https://forum.cloudron.io/post/126094</link><guid isPermaLink="true">https://forum.cloudron.io/post/126094</guid><dc:creator><![CDATA[timconsidine]]></dc:creator><pubDate>Thu, 25 Jun 2026 08:50:28 GMT</pubDate></item><item><title><![CDATA[Reply to Old CPU &#x2F; No GPU &#x2F; Ollama Language Model? on Thu, 25 Jun 2026 08:29:48 GMT]]></title><description><![CDATA[<p dir="auto">I tried something like this in my 14 year old CPU <img src="https://forum.cloudron.io/assets/plugins/nodebb-plugin-emoji/emoji/android/1f642.png?v=8b6d81684d0" class="not-responsive emoji emoji-android emoji--slightly_smiling_face" style="height:23px;width:auto;vertical-align:middle" title=":-)" alt="🙂" /> It's still writing out the joke . I also have only 16GB RAM to give.</p>
]]></description><link>https://forum.cloudron.io/post/126085</link><guid isPermaLink="true">https://forum.cloudron.io/post/126085</guid><dc:creator><![CDATA[joseph]]></dc:creator><pubDate>Thu, 25 Jun 2026 08:29:48 GMT</pubDate></item></channel></rss>