Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status
-
@james said in Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status:
I have reviewed the data and could find anything really suspicious.
"couldn't find....." ?
How do we go from there? Happy to enable remote SSH support if helpful.
Many thanks,
-
@uwcrbc please send a mail with your Cloudron ID / dashbaord domain info to support@cloudron.io and enable remote ssh support for us, so we can take a closer look to get this resolved.
-
@uwcrbc please send a mail with your Cloudron ID / dashbaord domain info to support@cloudron.io and enable remote ssh support for us, so we can take a closer look to get this resolved.
-
We got the info and are looking into this. The findings so far are, that the calcom app is constantly restarting and causing the server to become very slow. Not sure why this seems to have cascading effects, since the resources are plenty, but depending on the server provider, the virtual CPUs might get throttled if one core is used 100% over a longer period of time. Will keep this updated on the findings.
-
We got the info and are looking into this. The findings so far are, that the calcom app is constantly restarting and causing the server to become very slow. Not sure why this seems to have cascading effects, since the resources are plenty, but depending on the server provider, the virtual CPUs might get throttled if one core is used 100% over a longer period of time. Will keep this updated on the findings.
-
Also: interestingly the moment the server started playing up is indeed the moment that I had roughly linked to our latest cal.com deployment.
I say latest because this Cloudron server had a previous cal.com app instance that we used as a test a while back - not sure if relevant but maybe it can help inform the situation. -
-
-
@girish Thanks for this - After looking into the box.log:
- post
systemctl restart box
I do see indeedbox:apphealthmonitor app health: xx running / 0 stopped / 0 unresponsive
entry type every 10 seconds or so. - pre
systemctl restart box
(when we experience the issue) I do not see much of thebox:apphealthmonitor app health:xx
entries. Rather, I do have a few rarebox:apphealthmonitor setHealth: <<CONTAINER_UID>> (<<URL>>) waiting for 1192.461 to update health
entries
Hopefully it helps?
- post
-
@uwcrbc I think there is a bug that the apphealthmonitor is getting stuck (for some reason). We have seen this happen but very rarely and not reproducibly . If you can give us access, can you send me a mail at support@cloudron.io ? I can debug this further .
-
I think I have another instance of this bug with the health monitor.
I have installed an app that does not have health checks and it shows as Not Responding in the dashboard even though it works fine.
However, it seems that because of that app and the health monitor getting stuck, any new apps that get installed or upgraded also fail their health checks and remain in Starting... mode in the dashboard.
I have rebooted the server and all the apps come up, other than the one mentioned above, then after this updated apps show the Starting... message.
P.S.
It would be really nice to add thebutton for the 'cloudron' service, like we have for all other services.