Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Feature Requests
  3. Disk space should never bring a whole server down

Disk space should never bring a whole server down

Scheduled Pinned Locked Moved Feature Requests
50 Posts 15 Posters 7.4k Views 15 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • marcusquinnM Offline
    marcusquinnM Offline
    marcusquinn
    wrote on last edited by
    #7
    # systemctl status box
    ● box.service - Cloudron Admin
       Loaded: loaded (/etc/systemd/system/box.service; enabled; vendor preset: enabled)
       Active: activating (auto-restart) (Result: exit-code) since Sat 2021-03-06 23:22:48 UTC; 88ms ago
      Process: 311 ExecStart=/home/yellowtent/box/box.js (code=exited, status=1/FAILURE)
     Main PID: 311 (code=exited, status=1/FAILURE)
    
    # systemctl status nginx
    ● nginx.service - nginx - high performance web server
       Loaded: loaded (/lib/systemd/system/nginx.service; enabled; vendor preset: enabled)
      Drop-In: /etc/systemd/system/nginx.service.d
               └─cloudron.conf
       Active: active (running) since Sat 2021-03-06 23:09:24 UTC; 14min ago
         Docs: http://nginx.org/en/docs/
      Process: 1431 ExecStart=/usr/sbin/nginx -c /etc/nginx/nginx.conf (code=exited, status=0/SUCCESS)
     Main PID: 1634 (nginx)
        Tasks: 17 (limit: 4915)
       CGroup: /system.slice/nginx.service
               ├─1634 nginx: master process /usr/sbin/nginx -c /etc/nginx/nginx.conf
               ├─1638 nginx: worker process
               ├─1639 nginx: worker process
               ├─1641 nginx: worker process
               ├─1642 nginx: worker process
               ├─1645 nginx: worker process
               ├─1646 nginx: worker process
               ├─1647 nginx: worker process
               ├─1648 nginx: worker process
               ├─1649 nginx: worker process
               ├─1650 nginx: worker process
               ├─1651 nginx: worker process
               ├─1652 nginx: worker process
               ├─1653 nginx: worker process
               ├─1654 nginx: worker process
               ├─1655 nginx: worker process
               └─1656 nginx: worker process
    
    Mar 06 23:09:23 cloudron01 systemd[1]: Starting nginx - high performance web server...
    Mar 06 23:09:24 cloudron01 systemd[1]: Started nginx - high performance web server.
    

    Sorry, I have to work evenings and weekends, it's the only time I can concentrate on the deep work without all the emails & messages interruptions of the week days.

    Web Design https://www.evergreen.je
    Development https://brandlight.org
    Life https://marcusquinn.com

    1 Reply Last reply
    0
    • marcusquinnM Offline
      marcusquinnM Offline
      marcusquinn
      wrote on last edited by
      #8
      # systemctl status unbound
      ● unbound.service - Unbound DNS Resolver
         Loaded: loaded (/etc/systemd/system/unbound.service; enabled; vendor preset: enabled)
         Active: active (running) since Sat 2021-03-06 23:26:38 UTC; 8s ago
       Main PID: 20802 (unbound)
          Tasks: 1 (limit: 4915)
         CGroup: /system.slice/unbound.service
                 └─20802 /usr/sbin/unbound -d
      
      Mar 06 23:26:38 cloudron01 systemd[1]: Started Unbound DNS Resolver.
      Mar 06 23:26:38 cloudron01 unbound[20802]: [20802:0] notice: init module 0: subnet
      Mar 06 23:26:38 cloudron01 unbound[20802]: [20802:0] notice: init module 1: validator
      Mar 06 23:26:38 cloudron01 unbound[20802]: [20802:0] notice: init module 2: iterator
      Mar 06 23:26:38 cloudron01 unbound[20802]: [20802:0] info: start of service (unbound 1.6.7).
      Mar 06 23:26:39 cloudron01 unbound[20802]: [20802:0] error: could not fflush(/var/lib/unbound/root.key): No space left on device
      Mar 06 23:26:39 cloudron01 unbound[20802]: [20802:0] error: could not fflush(/var/lib/unbound/root.key): No space left on device
      

      Web Design https://www.evergreen.je
      Development https://brandlight.org
      Life https://marcusquinn.com

      1 Reply Last reply
      0
      • marcusquinnM Offline
        marcusquinnM Offline
        marcusquinn
        wrote on last edited by
        #9

        In just cannot get my head around how the disk can be allowed to fill to the point of a total system failure.

        Slowdown, sure I understand - but it's a total fail and I can't see why this isn't preventable.

        Is it really all one has to to bring a Cloudron down is load it up with data?

        There's a bunch of Apps that allow for uploads, it really wouldn't take much effort to flood those with a few GB.

        Web Design https://www.evergreen.je
        Development https://brandlight.org
        Life https://marcusquinn.com

        1 Reply Last reply
        0
        • nebulonN nebulon

          Last time we checked it was actually not easy to preserve space for the system as such, we would need support from the filesystem here.

          Either way after clearing up some space, did you follow https://docs.cloudron.io/troubleshooting/#recovery-after-disk-full ?

          marcusquinnM Offline
          marcusquinnM Offline
          marcusquinn
          wrote on last edited by
          #10

          @nebulon @girish App have Memory & CPU allocations - any reason they can't have disk- space allocations too?

          I'd rather a single app hits a wall than an entire server.

          It seems all one would have to do to bring a whole Cloudron down this way would be send a lot of email attachments to the point of disk saturation.

          Maybe I'm wrong and it's something else - but feel free to delete this post and move to email if it's a reproducible risk.

          Web Design https://www.evergreen.je
          Development https://brandlight.org
          Life https://marcusquinn.com

          girishG 1 Reply Last reply
          0
          • marcusquinnM marcusquinn

            @nebulon @girish App have Memory & CPU allocations - any reason they can't have disk- space allocations too?

            I'd rather a single app hits a wall than an entire server.

            It seems all one would have to do to bring a whole Cloudron down this way would be send a lot of email attachments to the point of disk saturation.

            Maybe I'm wrong and it's something else - but feel free to delete this post and move to email if it's a reproducible risk.

            girishG Offline
            girishG Offline
            girish
            Staff
            wrote on last edited by
            #11

            @marcusquinn said in Disk space should never bring a whole server down:

            @nebulon @girish App have Memory & CPU allocations - any reason they can't have disk- space allocations too?

            Yes, the memory & CPU allocations are features of the linux kernel cgroups. However, disk space allocation is not part of them.

            I guess the issue atleast to handle right now is that for some reason disk space is full. Running docker image prune -a sometimes frees up some disk space. Can you try that? Alternately, if you drop me a mail on support , I can look into the server.

            marcusquinnM 1 Reply Last reply
            1
            • girishG girish

              @marcusquinn said in Disk space should never bring a whole server down:

              @nebulon @girish App have Memory & CPU allocations - any reason they can't have disk- space allocations too?

              Yes, the memory & CPU allocations are features of the linux kernel cgroups. However, disk space allocation is not part of them.

              I guess the issue atleast to handle right now is that for some reason disk space is full. Running docker image prune -a sometimes frees up some disk space. Can you try that? Alternately, if you drop me a mail on support , I can look into the server.

              marcusquinnM Offline
              marcusquinnM Offline
              marcusquinn
              wrote on last edited by
              #12

              @girish said in Disk space should never bring a whole server down:

              docker image prune -a

              OK, thanks, tried that: "Total reclaimed space: 816.4MB"

              Still no respondio though. Have emailed support@ but 2am here and an early start, so back online in 8h or so, by which it'll be your 2am, and appreciate it's Saturday, so just grateful for pointers and hoping I might have some other requests for assistance waking up soon too.

              Web Design https://www.evergreen.je
              Development https://brandlight.org
              Life https://marcusquinn.com

              1 Reply Last reply
              0
              • robiR Offline
                robiR Offline
                robi
                wrote on last edited by
                #13

                Can we add a disk space email alert, and event where after some critical threshold /tmp is cleaned up & docker images pruned by Cloudron.

                Completely avoidable with a bit of this..

                Conscious tech

                marcusquinnM girishG 2 Replies Last reply
                4
                • robiR robi

                  Can we add a disk space email alert, and event where after some critical threshold /tmp is cleaned up & docker images pruned by Cloudron.

                  Completely avoidable with a bit of this..

                  marcusquinnM Offline
                  marcusquinnM Offline
                  marcusquinn
                  wrote on last edited by
                  #14

                  I'm wondering if maybe Cloudron should have its own volume by default.

                  A quick search in the subject but kinda tired now:

                  • https://www.reddit.com/r/docker/comments/loleal/how_to_limit_disk_space_for_a_docker_container/
                  • https://guide.blazemeter.com/hc/en-us/articles/115003812129-Overcoming-Container-Storage-Limitation-Overcoming-Container-Storage-Limitation#:~:text=In the current Docker version,be left in the container
                  • https://stackoverflow.com/questions/38542426/docker-container-specific-disk-quota

                  Web Design https://www.evergreen.je
                  Development https://brandlight.org
                  Life https://marcusquinn.com

                  girishG 1 Reply Last reply
                  0
                  • marcusquinnM marcusquinn

                    I'm wondering if maybe Cloudron should have its own volume by default.

                    A quick search in the subject but kinda tired now:

                    • https://www.reddit.com/r/docker/comments/loleal/how_to_limit_disk_space_for_a_docker_container/
                    • https://guide.blazemeter.com/hc/en-us/articles/115003812129-Overcoming-Container-Storage-Limitation-Overcoming-Container-Storage-Limitation#:~:text=In the current Docker version,be left in the container
                    • https://stackoverflow.com/questions/38542426/docker-container-specific-disk-quota
                    girishG Offline
                    girishG Offline
                    girish
                    Staff
                    wrote on last edited by
                    #15

                    @marcusquinn Managed to bring it up by truncating many logs. Should be coming up in a bit, hold on.

                    marcusquinnM d19dotcaD 2 Replies Last reply
                    2
                    • girishG girish

                      @marcusquinn Managed to bring it up by truncating many logs. Should be coming up in a bit, hold on.

                      marcusquinnM Offline
                      marcusquinnM Offline
                      marcusquinn
                      wrote on last edited by
                      #16

                      @girish Ahhhh - thank you kindly!

                      I have an unused 1TB volume mounted, although I'm not sure how much of the remaining free space is used in the Move function, as I guess that was killing it when I triggered to move the 16GB Jira App data to it?

                      Web Design https://www.evergreen.je
                      Development https://brandlight.org
                      Life https://marcusquinn.com

                      girishG 1 Reply Last reply
                      0
                      • girishG girish

                        @marcusquinn Managed to bring it up by truncating many logs. Should be coming up in a bit, hold on.

                        d19dotcaD Offline
                        d19dotcaD Offline
                        d19dotca
                        wrote on last edited by d19dotca
                        #17

                        @girish said in Disk space should never bring a whole server down:

                        Managed to bring it up by truncating many logs

                        Is this perhaps related to the issue I reported a little while back too, regarding the logrotate not running properly under certain circumstances?

                        --
                        Dustin Dauncey
                        www.d19.ca

                        marcusquinnM 1 Reply Last reply
                        1
                        • d19dotcaD d19dotca

                          @girish said in Disk space should never bring a whole server down:

                          Managed to bring it up by truncating many logs

                          Is this perhaps related to the issue I reported a little while back too, regarding the logrotate not running properly under certain circumstances?

                          marcusquinnM Offline
                          marcusquinnM Offline
                          marcusquinn
                          wrote on last edited by
                          #18

                          @d19dotca I remembered that mention, although fading brain never found or got to looking at that. I kinda think this situation is a bit too easy to get into and hard to get out of once its Terminal only.

                          Web Design https://www.evergreen.je
                          Development https://brandlight.org
                          Life https://marcusquinn.com

                          1 Reply Last reply
                          0
                          • marcusquinnM Offline
                            marcusquinnM Offline
                            marcusquinn
                            wrote on last edited by
                            #19

                            Going to trigger a move on Confluence to the mounted volume, it's 4.5GB with 7.5GB free space now on the main volume - so hopefully that's enough working space but I have to zzz, problems where I know I don't immediately know how to solve are kinda exhausting.

                            Web Design https://www.evergreen.je
                            Development https://brandlight.org
                            Life https://marcusquinn.com

                            1 Reply Last reply
                            0
                            • marcusquinnM marcusquinn

                              @girish Ahhhh - thank you kindly!

                              I have an unused 1TB volume mounted, although I'm not sure how much of the remaining free space is used in the Move function, as I guess that was killing it when I triggered to move the 16GB Jira App data to it?

                              girishG Offline
                              girishG Offline
                              girish
                              Staff
                              wrote on last edited by
                              #20

                              @marcusquinn looks like things are back up! There is ~7GB left, so hopefully that should hold up for sometime.

                              1 Reply Last reply
                              1
                              • girishG Offline
                                girishG Offline
                                girish
                                Staff
                                wrote on last edited by
                                #21

                                I am looking into some clues on what can be done to mitigate this, will report back. BTW, for the volume suggestion, this is possible. In fact, we used to do this very long ago with each app having it's own btrfs partition. Usually, people start with a simple VPS. This means that for this to work out of the box one has to create a loopback file system which is very slow. Also, when I logged in to your server, it was mysql that was down which was not happy with lack of disk.

                                I am wondering if the solution involves suggesting the user to make a specific kind of setup if they want to protect themselves against this kind of issue. That is totally doable (for example, suggest user to move platformdata and boxdata to a separate volume/disk post installation)

                                1 Reply Last reply
                                2
                                • robiR robi

                                  Can we add a disk space email alert, and event where after some critical threshold /tmp is cleaned up & docker images pruned by Cloudron.

                                  Completely avoidable with a bit of this..

                                  girishG Offline
                                  girishG Offline
                                  girish
                                  Staff
                                  wrote on last edited by
                                  #22

                                  @robi We actually have a disk space alert, in fact, it's there right now in the dashboard.

                                  57ac2165-226d-4f31-aa1e-a22cf131df92-image.png

                                  But the above is not super useful because it's just checking space in a cronjob. This cronjob is quite conservative because we don't want to keep spinning the disk too much. I am not aware of a way to get a "signal" from the server when disk space limits are hit. If a server fills up too fast between cron runs, the whole thing is useless...

                                  robiR mehdiM 2 Replies Last reply
                                  2
                                  • marcusquinnM Offline
                                    marcusquinnM Offline
                                    marcusquinn
                                    wrote on last edited by marcusquinn
                                    #23

                                    I've triggered some bigger app data moves to the mounted 1TB volume but it seems to have chewed through 3GB of the remaining free space on the main volume already and I'm back to "Cloudron is offline. Reconnecting". Probably just making hasty tiredness errors now.

                                    Web Design https://www.evergreen.je
                                    Development https://brandlight.org
                                    Life https://marcusquinn.com

                                    girishG scookeS 2 Replies Last reply
                                    0
                                    • marcusquinnM marcusquinn

                                      I've triggered some bigger app data moves to the mounted 1TB volume but it seems to have chewed through 3GB of the remaining free space on the main volume already and I'm back to "Cloudron is offline. Reconnecting". Probably just making hasty tiredness errors now.

                                      girishG Offline
                                      girishG Offline
                                      girish
                                      Staff
                                      wrote on last edited by
                                      #24

                                      @marcusquinn maybe it's best to move them by hand first. Can you send me the apps you want to move by email and I can move it by hand since this seems to keep hitting a wall. ie. free space -> try to free space -> run out of space and start over...

                                      1 Reply Last reply
                                      2
                                      • girishG girish

                                        @robi We actually have a disk space alert, in fact, it's there right now in the dashboard.

                                        57ac2165-226d-4f31-aa1e-a22cf131df92-image.png

                                        But the above is not super useful because it's just checking space in a cronjob. This cronjob is quite conservative because we don't want to keep spinning the disk too much. I am not aware of a way to get a "signal" from the server when disk space limits are hit. If a server fills up too fast between cron runs, the whole thing is useless...

                                        robiR Offline
                                        robiR Offline
                                        robi
                                        wrote on last edited by
                                        #25

                                        @girish yes, but does it email you when approaching the threshold?

                                        threshold setting? (twice a day should be plenty)

                                        action setting checkboxes? (maybe a custom one too?)

                                        heck, even deleting an non critical app would be fine since it's restorable from backup.

                                        Conscious tech

                                        1 Reply Last reply
                                        0
                                        • marcusquinnM marcusquinn

                                          I've triggered some bigger app data moves to the mounted 1TB volume but it seems to have chewed through 3GB of the remaining free space on the main volume already and I'm back to "Cloudron is offline. Reconnecting". Probably just making hasty tiredness errors now.

                                          scookeS Offline
                                          scookeS Offline
                                          scooke
                                          wrote on last edited by
                                          #26

                                          @marcusquinn Hang in there @marcusquinn. Bonne courage.

                                          A life lived in fear is a life half-lived

                                          1 Reply Last reply
                                          2
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • Bookmarks
                                          • Search