Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps - Status | Demo | Docs | Install
  1. Cloudron Forum
  2. Support
  3. De-duplicating e-mails

De-duplicating e-mails

Scheduled Pinned Locked Moved Solved Support
email
7 Posts 4 Posters 1.8k Views 4 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • E Offline
    E Offline
    ekevu123
    wrote on last edited by joseph
    #1

    Unfortunately, I still have an issue left after restoring my server today.

    When I restored my cloudron, e-mails were only partially restored. I then used imapsync to finish the job, which it did - however, I am relatively certain it didn't recognise the emails that were already in the account and copied them again. Now I have a mix of e-mails, they are apparently all there, but I have 20GB more of those that will in the future slow down any backup if I don't do anything about it.

    What I have already tried:

    • using delete2duplicates --useheader "Message-Id" in imapsync
    • imapdedump (a python tool I found for that purpose)

    They deleted a few duplicates, but nothing significant. I still wonder, one user account has 80% more data, I don't really understand what else could be the reason.

    I would ideally want to avoid restoring from backup now, because over the course of the day, there were already e-mails coming in that would be lost then.

    1 Reply Last reply
    0
    • necrevistonnezrN Offline
      necrevistonnezrN Offline
      necrevistonnezr
      wrote on last edited by
      #2

      Is it possible that emails were added by mail clients as they started syncing again after the restore? And maybe these email clients didn’t recognize the duplicates?

      1 Reply Last reply
      0
      • E Offline
        E Offline
        ekevu123
        wrote on last edited by
        #3

        Could be! But how should I resolve that? delete2duplicates has actually found a few thousand e-mails, and everything seems to be there, but I still have about 20GB more than before.

        jdaviescoatesJ 1 Reply Last reply
        0
        • E ekevu123

          Could be! But how should I resolve that? delete2duplicates has actually found a few thousand e-mails, and everything seems to be there, but I still have about 20GB more than before.

          jdaviescoatesJ Offline
          jdaviescoatesJ Offline
          jdaviescoates
          wrote on last edited by
          #4

          @ekevu123 said in De-duplicating e-mails:

          still have about 20GB more than before.

          based on what? Note that Cloudron graphs are not live.

          I use Cloudron with Gandi & Hetzner

          1 Reply Last reply
          0
          • E Offline
            E Offline
            ekevu123
            wrote on last edited by
            #5

            Old server before moving, vs. new server after moving, and I updated the graph after the process was done

            1 Reply Last reply
            0
            • J Online
              J Online
              joseph
              Staff
              wrote on last edited by
              #6

              Haven't dealt with such situations.. Would it make sense to do a file listing and compare what files are extra? IIRC, mailbox format saves emails in separate files.

              1 Reply Last reply
              0
              • J joseph marked this topic as a question on
              • E Offline
                E Offline
                ekevu123
                wrote on last edited by
                #7

                Maybe. But I think I'll just take the extra GB into account and see dynamically if and when issues arise. So far, nothing has been missed.

                1 Reply Last reply
                1
                • J joseph has marked this topic as solved on

                Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                With your input, this post could be even better 💗

                Register Login
                Reply
                • Reply as topic
                Log in to reply
                • Oldest to Newest
                • Newest to Oldest
                • Most Votes


                • Login

                • Don't have an account? Register

                • Login or register to search.
                • First post
                  Last post
                0
                • Categories
                • Recent
                • Tags
                • Popular
                • Bookmarks
                • Search