Graphite keeps crashing OOM
-
@rmdes can you ensure that you run the latest Cloudron version 6.2.7 ? There were fixes for graphite in the last one.
-
I don't have any visible python twisted package error anymore, but it seems the two last OOM event output this error : https://paste.armada.digital/enijufixep.coffeescript
-
I'm also on 6.2.7 but graphite has just started crashing with OOM in the last couple of days (it's never crashed for me before then).
Also agree with @robi that the notifications within Cloudron really ought to include the time it happened (which I can only tell by seeing what time the email notification arrived).
-
Just had another Graphite OOM crash.
Seems strange seeing it's mostly just me using my Cloudron atm and I'm not really doing anything on it.
What does Graphite actually do?
-
Just had another Graphite OOM crash.
Seems strange seeing it's mostly just me using my Cloudron atm and I'm not really doing anything on it.
What does Graphite actually do?
@jdaviescoates that service (graphite+collectd) collects the data used in the graphs, like memory usage over time. Given that it causes issues from time to time and also we don't really utilize it well, we are thinking of maybe collecting the data on our own and ditch graphite.
-
@jdaviescoates that service (graphite+collectd) collects the data used in the graphs, like memory usage over time. Given that it causes issues from time to time and also we don't really utilize it well, we are thinking of maybe collecting the data on our own and ditch graphite.
Thanks
@nebulon said in Graphite keeps crashing OOM:
collecting the data on our own
What would that look like?
-
Thanks
@nebulon said in Graphite keeps crashing OOM:
collecting the data on our own
What would that look like?
-
@jdaviescoates we don't know yet
-
Graphite OOM, again.
-
Graphite OOM, again.
@jdaviescoates how much memory as the limit is set in your case? Also does the server itself have enough free memory to allocate? The settings in Cloudron are only the upper limit, but it may still get killed with oom if there is none available system-wide
-
@jdaviescoates how much memory as the limit is set in your case? Also does the server itself have enough free memory to allocate? The settings in Cloudron are only the upper limit, but it may still get killed with oom if there is none available system-wide
@nebulon it was at whatever the default is (256MB?) I've now upped it to 512MB to see if that stops it. Plenty of spare RAM on the machine.
-
@jdaviescoates how much memory as the limit is set in your case? Also does the server itself have enough free memory to allocate? The settings in Cloudron are only the upper limit, but it may still get killed with oom if there is none available system-wide
-
All this does not sound right then. Do you see anything suspicious in the graphite logs as such? Like frequent restarts of something or so?
-
@nebulon This is the only errors I find in the log, beside the restarts :
https://paste.armada.digital/xanopucuqu.sql -
@nebulon my graphite service has 1.60GB available, still OOM several times a day..
the machine where cloudron is running has 30GB available, on average 15 Gb is being used leaving half of the available memory free.