Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Navigation

    Cloudron Forum

    • Register
    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular

    UNSOLVED Backup Stuck until manually stopped.

    Support
    backups
    5
    17
    99
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • nebulon
      nebulon Staff last edited by

      A bit more (earlier) logs could be helpful here, given that the process got a SIGKILL this could indicate that the system was simply running out of memory and the kernel has killed that process to free up some. Usually dmesg or journalctl --system would give more indication here.

      marcusquinn 1 Reply Last reply Reply Quote 1
      • marcusquinn
        marcusquinn @nebulon last edited by

        @nebulon emailed to support@cloudron.io - thanks 🙂

        girish 1 Reply Last reply Reply Quote 0
        • robi
          robi last edited by

          Ditto, emailed in the logs.

          1 Reply Last reply Reply Quote 0
          • robi
            robi last edited by

            There was nothing useful in the main log.

            There is a different set of backup task logs in the Cloudron Backups menu at the bottom of the page.

            This was requested for the relevant backup task and sent in.

            Pro Tip: The URL of the backups log can be modified to specify the task to search for.

            1 Reply Last reply Reply Quote 0
            • girish
              girish Staff last edited by

              @robi I was just looking into the logs. It has

              2020-10-31T21:00:18.640Z box:database Connection 548 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              2020-10-31T21:00:18.854Z box:database Connection 543 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              2020-10-31T21:00:19.146Z box:database Connection 544 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              2020-10-31T21:00:19.419Z box:database Connection 545 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              2020-10-31T21:00:19.797Z box:shell backup-snapshot/app_0800bb6e-f3a5-48fa-994b-1bf0d0d6a67f (stdout): 2020-10-31T21:00:19.797Z box:database Connection 549 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              
              2020-10-31T21:00:19.800Z box:shell backup-snapshot/app_0800bb6e-f3a5-48fa-994b-1bf0d0d6a67f (stdout): 2020-10-31T21:00:19.800Z box:database Connection 550 error: Connection lost: The server closed the connection. PROTOCOL_CONNECTION_LOST
              

              This suggest that MySQL went down. Do you see out of memory errors in dmesg output ?

              1 Reply Last reply Reply Quote 0
              • girish
                girish Staff last edited by

                Also, can you provide the output of free -m. That should give a good idea of how much memory is available on the server.

                robi 1 Reply Last reply Reply Quote 0
                • girish
                  girish Staff @marcusquinn last edited by

                  @marcusquinn said in Backup Stuck until manually stopped.:

                  @nebulon emailed to support@cloudron.io - thanks

                  In your case, the backup got Killed because of lack of memory. I have bumped up the memory for the backup process under Backups -> Configure and started a new backup now. Let's see.

                  marcusquinn girish 2 Replies Last reply Reply Quote 1
                  • marcusquinn
                    marcusquinn @girish last edited by

                    @girish Danke 🙂

                    1 Reply Last reply Reply Quote 0
                    • robi
                      robi @girish last edited by

                      @girish

                      no memory errors in dmesg, 6.5GB free mem, the server graphs show mysql using 15% of the 2.5GB allocated, so it's not that.

                      My guess, we look at the DB health during that time?

                      What can we do about recovering from the connection error and continuing the backup or erroring out sooner so we avoid the deadlock and let the next backup run?

                      girish 1 Reply Last reply Reply Quote 0
                      • girish
                        girish Staff @girish last edited by

                        @girish said in Backup Stuck until manually stopped.:

                        In your case, the backup got Killed because of lack of memory. I have bumped up the memory for the backup process under Backups -> Configure and started a new backup now. Let's see.

                        To post a final update on this: the issue was that one app had 2 million files and another had a million files. It looks like node gets a heap out of memory when dealing with such large number of files (specifically the tar module we use). Don't know if this is easily fixable.

                        1 Reply Last reply Reply Quote 3
                        • girish
                          girish Staff @robi last edited by

                          @robi said in Backup Stuck until manually stopped.:

                          What can we do about recovering from the connection error and continuing the backup or erroring out sooner so we avoid the deadlock and let the next backup run?

                          I think there is a bug. It seems when the database connection goes down, the backup process gets stuck. Ideally, it should just fail at that point. I am looking into reproducing this.

                          robi 1 Reply Last reply Reply Quote 1
                          • robi
                            robi @girish last edited by

                            @girish I have reports that this just happened again.

                            1 Reply Last reply Reply Quote 0
                            • First post
                              Last post