Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Support
  3. Auto-update to 8.3 - various apps down - database issue

Auto-update to 8.3 - various apps down - database issue

Scheduled Pinned Locked Moved Solved Support
updatepostgresqlpgvector
27 Posts 12 Posters 1.5k Views 13 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • girishG girish

    @shrey said in Auto-update to 8.3 - various apps down - database issue:

    The postgres databases have all been nuked!

    This is how the upgrade is carried out. The databases are exported, a new postgres is started from fresh and then they are all reimported. During this process postgres does have unlimited memory.

    However, for reasons, we are yet to figure out, on some servers, it seems the reimport fails because postgres is somehow busy. So far, we haven't logs as to why this fails. The fix is as you found out - just do the reimport by restoring the apps (the upgrade automated this but the failure makes the end user do this manually).

    S Online
    S Online
    shrey
    wrote on last edited by
    #15

    @girish said in Auto-update to 8.3 - various apps down - database issue:

    but the failure makes the end user do this manually

    Well, this manual process costed me several hours of downtime of 'production' services, as well as another couple of hours for restoring them (at lot of backup files are really big, e.g. immich) 🫤

    girishG 1 Reply Last reply
    0
    • S shrey

      @girish said in Auto-update to 8.3 - various apps down - database issue:

      but the failure makes the end user do this manually

      Well, this manual process costed me several hours of downtime of 'production' services, as well as another couple of hours for restoring them (at lot of backup files are really big, e.g. immich) 🫤

      girishG Offline
      girishG Offline
      girish
      Staff
      wrote on last edited by
      #16

      @shrey yes, there is clearly some bug somewhere unfortunately 😞

      CptPlasticC 1 Reply Last reply
      0
      • CptPlasticC Offline
        CptPlasticC Offline
        CptPlastic
        wrote on last edited by
        #17

        SHAME ON YOU! I just woke up to a mess of cloudron servers being all messed up over this 8.3 update. To make matters worse almost every single database had to be restored. Some of this caused data loss because the apps on some accounts are used for logging 24/7. I learned a valuable lesson. Turn off auto update. 😠

        1 Reply Last reply
        0
        • girishG girish

          @shrey yes, there is clearly some bug somewhere unfortunately 😞

          CptPlasticC Offline
          CptPlasticC Offline
          CptPlastic
          wrote on last edited by
          #18

          @girish Man this is bad

          girishG 1 Reply Last reply
          0
          • CptPlasticC CptPlastic

            @girish Man this is bad

            girishG Offline
            girishG Offline
            girish
            Staff
            wrote on last edited by
            #19

            @CptPlastic if the cloudron is still in error state, would like to take a look at it. can you email us at support@cloudron.io ?

            1 Reply Last reply
            1
            • nebulonN Offline
              nebulonN Offline
              nebulon
              Staff
              wrote on last edited by
              #20

              Not sure if strong words help the cause, it is not like we introduce bugs or slack on testing on purpose.

              I wonder where the data loss comes in though, there should be only a small timeframe between app backup and app being down (so no data can get changed/added) while the app was down.

              1 Reply Last reply
              4
              • timconsidineT Offline
                timconsidineT Offline
                timconsidine
                App Dev
                wrote on last edited by timconsidine
                #21

                I share the feelings of alarm but we do need to keep it in context - it’s IT - it always goes wrong 😂 - we are amazed and pleasantly, nay deliriously, happy when it works smoothly, as mostly Cloudron does.

                Apart from root cause stuff and understanding why it is smooth for some and (what feels in the moment) near-disaster for others, the interesting point for me is : when is a backup a backup ? When can it be relied on ?

                I couldn’t restore to the pre-update backups, needed to go further back, and then manually add lost stuff. In my case it wasn’t nice, but it wasn’t that hard, but maybe I was lucky, judging by other reports.

                But that’s probably an unanswerable question.
                I wonder if snapshots offer more than backups in this scenario. But I guess that’s outside Cloudron realm.

                Maybe biggest takeaway from all this : don’t make big updates of core services auto-update. Give the user notice, and a chance to make a snapshot or some other cautious approach, and leave them to decide when to run it.

                1 Reply Last reply
                6
                • M Offline
                  M Offline
                  mazarian
                  wrote on last edited by
                  #22

                  Add me to the list of people experiencing the same issue. Hoping for a solution soon! While some apps were able to be restored from backup, I have multiple Chatwoot instances that will not restore.

                  1 Reply Last reply
                  0
                  • J Offline
                    J Offline
                    joseph
                    Staff
                    wrote on last edited by
                    #23

                    @mazarian what is the error when you try to restore?

                    M 1 Reply Last reply
                    0
                    • J joseph

                      @mazarian what is the error when you try to restore?

                      M Offline
                      M Offline
                      mazarian
                      wrote on last edited by
                      #24

                      @joseph thank you for your reply! This is the error:

                      An error occurred during the restore operation: Addons Error: Unexpected response code or HTTP error when piping /home/yellowtent/appsdata/fa89594f-7176-4a81-9c25-1686af1e50da/postgresqldump to http://172.18.30.2:3000/databases/dbfa89594f71764a819c251686af1e50da/restore?access_token=XXXX&username=userXXXX: status 500 complete false
                      
                      1 Reply Last reply
                      0
                      • girishG Offline
                        girishG Offline
                        girish
                        Staff
                        wrote on last edited by girish
                        #25

                        @mazarian thanks for the access, we got to the bottom of the issue!

                        The issue is that the pgvector extension is crashing on some servers. Every time an app like immich or chatwoot attempts to use this extensions, the entire database crashes 😕 This then makes the database go into recovery mode.

                        You can read more at:
                        https://github.com/pgvector/pgvector/issues/143
                        https://github.com/pgvector/pgvector/issues/752
                        https://github.com/pgvector/pgvector/issues/389

                        The database container has been updated now . https://git.cloudron.io/platform/box/-/commit/d2de2c7093e72bdcd3c5e6ea9f8d5dc88a595b77 . We will make a 8.3.1 with the fix.

                        M 1 Reply Last reply
                        5
                        • girishG girish

                          @mazarian thanks for the access, we got to the bottom of the issue!

                          The issue is that the pgvector extension is crashing on some servers. Every time an app like immich or chatwoot attempts to use this extensions, the entire database crashes 😕 This then makes the database go into recovery mode.

                          You can read more at:
                          https://github.com/pgvector/pgvector/issues/143
                          https://github.com/pgvector/pgvector/issues/752
                          https://github.com/pgvector/pgvector/issues/389

                          The database container has been updated now . https://git.cloudron.io/platform/box/-/commit/d2de2c7093e72bdcd3c5e6ea9f8d5dc88a595b77 . We will make a 8.3.1 with the fix.

                          M Offline
                          M Offline
                          mazarian
                          wrote on last edited by
                          #26

                          @girish thank you guys for everything you do and have done to have a stable platform! It's nice to see that even if issues come up, you guys are there for support - moreso than some of the large companies I buy tons of equipment from! You guys are awesome!

                          1 Reply Last reply
                          5
                          • C Offline
                            C Offline
                            crazybrad
                            wrote on last edited by
                            #27

                            @timconsidine You are so right. The best day for Team Cloudron is one in which there are no updates. I give @girish @nebulon @joseph an enormous amount of credit for the job they do. As someone who rolls out ~ monthly updates to users, every time that happens I pray that me and my team didn't miss something important. But it happens to all of us, despite good processes and best of intentions. But 8.3 will be replaced shortly with 8.3.1 and then 8.4 and then 9.0. And this difficult day will be replaced in everyone's memory by the great things that are yet to come!

                            1 Reply Last reply
                            6
                            • J joseph forked this topic on
                            • J joseph forked this topic on
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Don't have an account? Register

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • Bookmarks
                            • Search