Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Support
  3. Auto-update to 8.3 - various apps down - database issue

Auto-update to 8.3 - various apps down - database issue

Scheduled Pinned Locked Moved Solved Support
updatepostgresqlpgvector
27 Posts 12 Posters 1.5k Views 13 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • timconsidineT Offline
    timconsidineT Offline
    timconsidine
    App Dev
    wrote on last edited by girish
    #1

    Not very happy that 8.3 rolled itself out when it doesn’t seem stable
    And it had to happen when I am in the middle of projects, which means I don’t have time to research fixes.
    So, seeking to leverage the ppower of the community )not being lazy of course) is there a standard fix path for issues like Error : - Error setting up postgresql. Status code: 500 message: the database system is in recovery mode ?
    I have 10 apps down, some of which are core productivity apps like paperless and gitlab.

    1 Reply Last reply
    3
    • timconsidineT Offline
      timconsidineT Offline
      timconsidine
      App Dev
      wrote on last edited by
      #2

      Seems that :

      • some can fixed with a simple Retry Task
      • some with Retry Task then Restart app/container
      • some stay as not responding despite those steps

      Will have to investigate tomorrow

      1 Reply Last reply
      1
      • timconsidineT Offline
        timconsidineT Offline
        timconsidine
        App Dev
        wrote on last edited by timconsidine
        #3

        Update : after manually doing retry task and restart on all apps with error (sometimes multiple times), and restarting postgres and the relevant redis, I could resolve all except :

        • keycloak : a new installation so uninstalled and will re-install later
        • onlyoffice : don’t use much so uninstalled and trying new installation

        So going to close this
        But worrying it happened.

        UPDATE : don’t see how to close - can @staff please do so 👍

        1 Reply Last reply
        2
        • fbartelsF Offline
          fbartelsF Offline
          fbartels
          App Dev
          wrote on last edited by
          #4

          https://forum.cloudron.io/topic/13412/8.3.0-postgres-upgrade-failure

          I ran into the same issue. For me all postgres databases were empty and the fix was to restore these specific apps from a backup from before the upgrade.

          1 Reply Last reply
          4
          • robiR Offline
            robiR Offline
            robi
            wrote on last edited by
            #5

            One of mine upgraded with no issues. 🤷

            Conscious tech

            1 Reply Last reply
            0
            • imc67I Offline
              imc67I Offline
              imc67
              translator
              wrote on last edited by
              #6

              Last night my big production Cloudron (out of 3) was updated but I'd noticed that on forehand so after reading issues I increased postgres memory a lot and made sure all apps were updated before. Luckily it went all good but it would've been a disaster if this one had longer downtime.

              robiR 1 Reply Last reply
              4
              • imc67I imc67

                Last night my big production Cloudron (out of 3) was updated but I'd noticed that on forehand so after reading issues I increased postgres memory a lot and made sure all apps were updated before. Luckily it went all good but it would've been a disaster if this one had longer downtime.

                robiR Offline
                robiR Offline
                robi
                wrote on last edited by
                #7

                @imc67 @staff perhaps that's a new feature Cloudron upgrades should have, especially when upgrading major database versions and schema updates which may need a bump in the memory requirements temporarily.

                Conscious tech

                1 Reply Last reply
                1
                • J joseph marked this topic as a question on
                • J joseph has marked this topic as solved on
                • S Offline
                  S Offline
                  shrey
                  wrote on last edited by
                  #8

                  My Cloudron instance for auto upgraded last night and now more than 50 apps are down!

                  Retry > Restart isn't working either, for a lot of them.

                  1 Reply Last reply
                  0
                  • S Offline
                    S Offline
                    shrey
                    wrote on last edited by shrey
                    #9

                    @staff

                    This is a rather incomprehensible blunder on the part of Cloudron! The postgres databases have all been nuked!

                    I'm having to manually restore almost all of the affected apps, which is a very time-intensive task, not to mention the several hours of unscheduled downtime these services have been under.

                    girishG 1 Reply Last reply
                    0
                    • d19dotcaD Offline
                      d19dotcaD Offline
                      d19dotca
                      wrote on last edited by d19dotca
                      #10

                      This is strange to read, as I updated to 8.3.0 over the weekend without any issues (other than a couple of SpamAssassin rules I needed to update because of deprecations in the latest version of SA being used in 8.30 Cloudron). Definitely no outages other than a slightly delayed startup while the databases migrated.

                      For what it’s worth, I had 4 GB capacity allocated to the PostgreSQL database service, so maybe that helped?

                      --
                      Dustin Dauncey
                      www.d19.ca

                      S 1 Reply Last reply
                      0
                      • d19dotcaD d19dotca

                        This is strange to read, as I updated to 8.3.0 over the weekend without any issues (other than a couple of SpamAssassin rules I needed to update because of deprecations in the latest version of SA being used in 8.30 Cloudron). Definitely no outages other than a slightly delayed startup while the databases migrated.

                        For what it’s worth, I had 4 GB capacity allocated to the PostgreSQL database service, so maybe that helped?

                        S Offline
                        S Offline
                        shrey
                        wrote on last edited by
                        #11

                        @d19dotca I too, have 4GB allocated to the Postgres service. Besides that, the resource graphs didn't even go anywhere near the threshold, during the update process.

                        d19dotcaD 1 Reply Last reply
                        0
                        • S shrey

                          @d19dotca I too, have 4GB allocated to the Postgres service. Besides that, the resource graphs didn't even go anywhere near the threshold, during the update process.

                          d19dotcaD Offline
                          d19dotcaD Offline
                          d19dotca
                          wrote on last edited by
                          #12

                          @shrey Interesting, I wonder what the magic number might be for memory then if that was the root cause. It likely depends on each server's environment too, it's probably something like X times the current/average use. My PG usage is typically only running around 512 MB at most, often less than that. So 4 GB was several times more than my daily usage numbers seem to represent.

                          On a side note with regards to the graphs, in my experience I find the graphs to be not too valuable, I think because it (correct me if I'm wrong though) only updates the usage every 5 minutes, so if it hits its max memory usage immediately in under 5 minutes and restarts for example then it won't necessarily be recorded. I may be wrong, but just my general experience, I tend to view the graphs with a grain of salt.

                          --
                          Dustin Dauncey
                          www.d19.ca

                          1 Reply Last reply
                          1
                          • nebulonN Offline
                            nebulonN Offline
                            nebulon
                            Staff
                            wrote on last edited by
                            #13

                            @shrey do you have any errors in the postgres service logs or do you see similar/same errors in app logs? Did the postgres service start fine after the platform update?

                            1 Reply Last reply
                            0
                            • S shrey

                              @staff

                              This is a rather incomprehensible blunder on the part of Cloudron! The postgres databases have all been nuked!

                              I'm having to manually restore almost all of the affected apps, which is a very time-intensive task, not to mention the several hours of unscheduled downtime these services have been under.

                              girishG Offline
                              girishG Offline
                              girish
                              Staff
                              wrote on last edited by
                              #14

                              @shrey said in Auto-update to 8.3 - various apps down - database issue:

                              The postgres databases have all been nuked!

                              This is how the upgrade is carried out. The databases are exported, a new postgres is started from fresh and then they are all reimported. During this process postgres does have unlimited memory.

                              However, for reasons, we are yet to figure out, on some servers, it seems the reimport fails because postgres is somehow busy. So far, we haven't logs as to why this fails. The fix is as you found out - just do the reimport by restoring the apps (the upgrade automated this but the failure makes the end user do this manually).

                              S 1 Reply Last reply
                              3
                              • girishG girish

                                @shrey said in Auto-update to 8.3 - various apps down - database issue:

                                The postgres databases have all been nuked!

                                This is how the upgrade is carried out. The databases are exported, a new postgres is started from fresh and then they are all reimported. During this process postgres does have unlimited memory.

                                However, for reasons, we are yet to figure out, on some servers, it seems the reimport fails because postgres is somehow busy. So far, we haven't logs as to why this fails. The fix is as you found out - just do the reimport by restoring the apps (the upgrade automated this but the failure makes the end user do this manually).

                                S Offline
                                S Offline
                                shrey
                                wrote on last edited by
                                #15

                                @girish said in Auto-update to 8.3 - various apps down - database issue:

                                but the failure makes the end user do this manually

                                Well, this manual process costed me several hours of downtime of 'production' services, as well as another couple of hours for restoring them (at lot of backup files are really big, e.g. immich) 🫤

                                girishG 1 Reply Last reply
                                0
                                • S shrey

                                  @girish said in Auto-update to 8.3 - various apps down - database issue:

                                  but the failure makes the end user do this manually

                                  Well, this manual process costed me several hours of downtime of 'production' services, as well as another couple of hours for restoring them (at lot of backup files are really big, e.g. immich) 🫤

                                  girishG Offline
                                  girishG Offline
                                  girish
                                  Staff
                                  wrote on last edited by
                                  #16

                                  @shrey yes, there is clearly some bug somewhere unfortunately 😞

                                  CptPlasticC 1 Reply Last reply
                                  0
                                  • CptPlasticC Offline
                                    CptPlasticC Offline
                                    CptPlastic
                                    wrote on last edited by
                                    #17

                                    SHAME ON YOU! I just woke up to a mess of cloudron servers being all messed up over this 8.3 update. To make matters worse almost every single database had to be restored. Some of this caused data loss because the apps on some accounts are used for logging 24/7. I learned a valuable lesson. Turn off auto update. 😠

                                    1 Reply Last reply
                                    0
                                    • girishG girish

                                      @shrey yes, there is clearly some bug somewhere unfortunately 😞

                                      CptPlasticC Offline
                                      CptPlasticC Offline
                                      CptPlastic
                                      wrote on last edited by
                                      #18

                                      @girish Man this is bad

                                      girishG 1 Reply Last reply
                                      0
                                      • CptPlasticC CptPlastic

                                        @girish Man this is bad

                                        girishG Offline
                                        girishG Offline
                                        girish
                                        Staff
                                        wrote on last edited by
                                        #19

                                        @CptPlastic if the cloudron is still in error state, would like to take a look at it. can you email us at support@cloudron.io ?

                                        1 Reply Last reply
                                        1
                                        • nebulonN Offline
                                          nebulonN Offline
                                          nebulon
                                          Staff
                                          wrote on last edited by
                                          #20

                                          Not sure if strong words help the cause, it is not like we introduce bugs or slack on testing on purpose.

                                          I wonder where the data loss comes in though, there should be only a small timeframe between app backup and app being down (so no data can get changed/added) while the app was down.

                                          1 Reply Last reply
                                          4
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • Bookmarks
                                          • Search