Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Support
  3. Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status

Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status

Scheduled Pinned Locked Moved Solved Support
performancestabilityboxplatform
29 Posts 6 Posters 1.1k Views 6 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • girishG Offline
    girishG Offline
    girish
    Staff
    wrote on last edited by
    #18

    This is probably happening because the apphealthmonitor is getting stuck (for some reason) and thus unable to update the health of apps. @uwcrbc in the box.log , do you see a line printed every 10 seconds like box:apphealthmonitor app health: xx ?

    1 Reply Last reply
    0
    • U Offline
      U Offline
      uwcrbc
      wrote on last edited by
      #19

      @girish Thanks for this - After looking into the box.log:

      • post systemctl restart box I do see indeed box:apphealthmonitor app health: xx running / 0 stopped / 0 unresponsive entry type every 10 seconds or so.
      • pre systemctl restart box (when we experience the issue) I do not see much of the box:apphealthmonitor app health:xx entries. Rather, I do have a few rare box:apphealthmonitor setHealth: <<CONTAINER_UID>> (<<URL>>) waiting for 1192.461 to update health entries

      Hopefully it helps?

      1 Reply Last reply
      0
      • girishG Offline
        girishG Offline
        girish
        Staff
        wrote on last edited by
        #20

        @uwcrbc I think there is a bug that the apphealthmonitor is getting stuck (for some reason). We have seen this happen but very rarely and not reproducibly . If you can give us access, can you send me a mail at support@cloudron.io ? I can debug this further .

        1 Reply Last reply
        1
        • N Offline
          N Offline
          Neiluj
          wrote on last edited by
          #21
          This post is deleted!
          1 Reply Last reply
          0
          • U Offline
            U Offline
            uwcrbc
            wrote on last edited by
            #22

            @girish Thanks for this. Access given / mail sent. Appreciate the help.

            1 Reply Last reply
            0
            • robiR Offline
              robiR Offline
              robi
              wrote last edited by
              #23

              I think I have another instance of this bug with the health monitor.

              I have installed an app that does not have health checks and it shows as Not Responding in the dashboard even though it works fine.

              However, it seems that because of that app and the health monitor getting stuck, any new apps that get installed or upgraded also fail their health checks and remain in Starting... mode in the dashboard.

              I have rebooted the server and all the apps come up, other than the one mentioned above, then after this updated apps show the Starting... message.

              P.S.
              It would be really nice to add the 🔁 button for the 'cloudron' service, like we have for all other services.

              Conscious tech

              girishG 1 Reply Last reply
              0
              • girishG Offline
                girishG Offline
                girish
                Staff
                wrote last edited by girish
                #24

                My home server got into a situation like this just now. It seems it's because the eventlog got flooded with many entries.

                +----------+
                | count(*) |
                +----------+
                |   563547 |
                +----------+
                
                mysql> SELECT action, COUNT(*) AS count
                    -> FROM eventlog
                    -> GROUP BY action
                    -> ORDER BY count DESC;
                +---------------------------+--------+
                | action                    | count  |
                +---------------------------+--------+
                | app.up                    | 446592 |
                | app.down                  | 106588 |
                | backup.cleanup.finish     |   3664 |
                | app.update.finish         |   1354 |
                | app.update                |   1354 |
                | backup.finish             |    920 |
                | backup.start              |    920 |
                | cloudron.update.finish    |    847 |
                | cloudron.update           |    833 |
                | cloudron.start            |    110 |
                | dyndns.update             |     78 |
                

                Those app up/down eventlogs are out of hand!

                I nuked them manually:

                mysql> DELETE from eventlog WHERE action='app.up';
                Query OK, 446782 rows affected (8 min 1,59 sec)
                
                mysql> DELETE from eventlog WHERE action='app.down';
                Query OK, 106588 rows affected (9 min 16,27 sec)
                
                
                

                That took a whopping 17min just to delete entries!

                1 Reply Last reply
                3
                • robiR Offline
                  robiR Offline
                  robi
                  wrote last edited by
                  #25

                  Oh wow, thanks for finding that Girish!

                  Are you pushing fixes?
                  Should we do the same manually?
                  Does box need a restart after or will apphealthmonitor just work better w/o half a million app.up entries?

                  Conscious tech

                  1 Reply Last reply
                  1
                  • girishG Offline
                    girishG Offline
                    girish
                    Staff
                    wrote last edited by
                    #26

                    I pushed fixes for this now.

                    The issue is if the mail container is down or mail is not working, then all the app up/down events accumulate (it's trying to send an email when app goes up/down). Since the fix is a bit involved, a workaround might be to disable app up/down email notifications in the Notification view (I haven't tried this though, but would have fixed my problem atleast).

                    robiR 1 Reply Last reply
                    0
                    • girishG girish

                      I pushed fixes for this now.

                      The issue is if the mail container is down or mail is not working, then all the app up/down events accumulate (it's trying to send an email when app goes up/down). Since the fix is a bit involved, a workaround might be to disable app up/down email notifications in the Notification view (I haven't tried this though, but would have fixed my problem atleast).

                      robiR Offline
                      robiR Offline
                      robi
                      wrote last edited by
                      #27

                      @girish I have all those notification settings disabled, so nothing to do with email for me.

                      Want to look around more?

                      Conscious tech

                      1 Reply Last reply
                      0
                      • robiR robi

                        I think I have another instance of this bug with the health monitor.

                        I have installed an app that does not have health checks and it shows as Not Responding in the dashboard even though it works fine.

                        However, it seems that because of that app and the health monitor getting stuck, any new apps that get installed or upgraded also fail their health checks and remain in Starting... mode in the dashboard.

                        I have rebooted the server and all the apps come up, other than the one mentioned above, then after this updated apps show the Starting... message.

                        P.S.
                        It would be really nice to add the 🔁 button for the 'cloudron' service, like we have for all other services.

                        girishG Offline
                        girishG Offline
                        girish
                        Staff
                        wrote last edited by
                        #28

                        @robi said in Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status:

                        I have installed an app that does not have health checks and it shows as Not Responding in the dashboard even though it works fine.

                        Does this mean you have a custom app which does not properly to healthCheckUrl?

                        robiR 1 Reply Last reply
                        0
                        • girishG girish

                          @robi said in Cloudron instance scaling issues after a few hours / couple of days, apps responsive but showing a permanent "Starting..." status:

                          I have installed an app that does not have health checks and it shows as Not Responding in the dashboard even though it works fine.

                          Does this mean you have a custom app which does not properly to healthCheckUrl?

                          robiR Offline
                          robiR Offline
                          robi
                          wrote last edited by
                          #29

                          @girish It's a LAMP app with apache configured to point to another directory.

                          Conscious tech

                          1 Reply Last reply
                          0
                          Reply
                          • Reply as topic
                          Log in to reply
                          • Oldest to Newest
                          • Newest to Oldest
                          • Most Votes


                          • Login

                          • Don't have an account? Register

                          • Login or register to search.
                          • First post
                            Last post
                          0
                          • Categories
                          • Recent
                          • Tags
                          • Popular
                          • Bookmarks
                          • Search