Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Support
  3. De-duplicating e-mails

De-duplicating e-mails

Scheduled Pinned Locked Moved Solved Support
email
7 Posts 4 Posters 425 Views 4 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • E Offline
    E Offline
    ekevu123
    wrote on last edited by joseph
    #1

    Unfortunately, I still have an issue left after restoring my server today.

    When I restored my cloudron, e-mails were only partially restored. I then used imapsync to finish the job, which it did - however, I am relatively certain it didn't recognise the emails that were already in the account and copied them again. Now I have a mix of e-mails, they are apparently all there, but I have 20GB more of those that will in the future slow down any backup if I don't do anything about it.

    What I have already tried:

    • using delete2duplicates --useheader "Message-Id" in imapsync
    • imapdedump (a python tool I found for that purpose)

    They deleted a few duplicates, but nothing significant. I still wonder, one user account has 80% more data, I don't really understand what else could be the reason.

    I would ideally want to avoid restoring from backup now, because over the course of the day, there were already e-mails coming in that would be lost then.

    1 Reply Last reply
    0
    • necrevistonnezrN Offline
      necrevistonnezrN Offline
      necrevistonnezr
      wrote on last edited by
      #2

      Is it possible that emails were added by mail clients as they started syncing again after the restore? And maybe these email clients didn’t recognize the duplicates?

      1 Reply Last reply
      0
      • E Offline
        E Offline
        ekevu123
        wrote on last edited by
        #3

        Could be! But how should I resolve that? delete2duplicates has actually found a few thousand e-mails, and everything seems to be there, but I still have about 20GB more than before.

        jdaviescoatesJ 1 Reply Last reply
        0
        • E ekevu123

          Could be! But how should I resolve that? delete2duplicates has actually found a few thousand e-mails, and everything seems to be there, but I still have about 20GB more than before.

          jdaviescoatesJ Offline
          jdaviescoatesJ Offline
          jdaviescoates
          wrote on last edited by
          #4

          @ekevu123 said in De-duplicating e-mails:

          still have about 20GB more than before.

          based on what? Note that Cloudron graphs are not live.

          I use Cloudron with Gandi & Hetzner

          1 Reply Last reply
          0
          • E Offline
            E Offline
            ekevu123
            wrote on last edited by
            #5

            Old server before moving, vs. new server after moving, and I updated the graph after the process was done

            1 Reply Last reply
            0
            • J Online
              J Online
              joseph
              Staff
              wrote on last edited by
              #6

              Haven't dealt with such situations.. Would it make sense to do a file listing and compare what files are extra? IIRC, mailbox format saves emails in separate files.

              1 Reply Last reply
              0
              • J joseph marked this topic as a question on
              • E Offline
                E Offline
                ekevu123
                wrote on last edited by
                #7

                Maybe. But I think I'll just take the extra GB into account and see dynamically if and when issues arise. So far, nothing has been missed.

                1 Reply Last reply
                1
                • J joseph has marked this topic as solved on
                Reply
                • Reply as topic
                Log in to reply
                • Oldest to Newest
                • Newest to Oldest
                • Most Votes


                • Login

                • Don't have an account? Register

                • Login or register to search.
                • First post
                  Last post
                0
                • Categories
                • Recent
                • Tags
                • Popular
                • Bookmarks
                • Search