SOLVED CPU usage breakdown?
-
I keep having this drop_caches process eating all my CPU
I tried to kill this process but no matter the way I choose to terminate/kill it it just keeps going, something is occupying the system,it seems to me, outside of the docker environnement.
-
@nebulon running apps are not the problem, what I see is high CPU usage at the / root level of the box
-
Ok this is interesting, according to https://linux-mm.org/Drop_Caches this means the system tries to free up memory pages. And this essentially from time to time.
From Cloudron side we do not actively do this at all, so this is likely Ubuntu behavior. If this takes up a lot of cpu and for longer time periods, this may be related to the system I/O performance as such.I have to read up further though, for example is this is swap related and then it will indeed be painful if the swap is on some slow disk.
-
After last reboot,the process is now gone and CPU is more stable, quite crazy how this process completely freeze the cloudron box, Swap is being used at 338M over 4GB available and seem to increase very slowly, CPU is now around 50% more or less, still with Highs of 89%
i guess it will be better when on Ubuntu 20.x ? and moving to a new VPS or VDS soon probably
-
Not sure if Ubuntu 20.04 will change anything here, this seems to be a pretty normal thing since a long time for linux distros.
Also this may not indicate that you need more main memory or so, it might simply mean that the process of discarding and finding those resources takes long on your system. Thus my concern about system I/O speeds.
Which VPS provider are you currently using and is this an SSD system?
-
@nebulon This is the VPS specs I'm using : https://contabo.com/?show=configurator&vserver_id=237
8 CPU cores
30 GB RAM (guaranteed)
800 GB disk space (100% SSD)
100% SSD disk space -
This looks ok and good enough to me. I am always a bit unsure about disk I/O measurements and also if this is the root cause here at all, but anyways maybe you can do some sanity check with
hdparm -t /dev/vda1
(replace /dev/vda1 with you main disk partition)It should be somewhere above 500MB/s at least on provider with SSD in my experience.
-
FYI, CPU usage is now stable and around 13%, more or less since Contabo did, and I quote their email, a "technical adjustement".
You can clearly see on that monthly CPU chart, how it evolved (granted it's early days, but still):
Not only this, but the dashboard now loads nearly instantly whereas beforehand, it could take up to 30 seconds.
Taking into account previous random issues with cheaper Contabo VPS in the past, I tend to think the issue was from Contabo from the start.
-
@ruihildt
nice to see it resolved outside of Cloudron.If you want a similarly sized and less expensive system, I can share an invite link for ssd nodes that's been working well for us.
-
@robi Yes please.
I also think next time I encounter a stability issue, I'll explore the provider angle sooner, maybe by replicating it elsewhere the same stack.