Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Uptime Kuma
  3. Regular short getaddrinfo EAI_AGAIN outages

Regular short getaddrinfo EAI_AGAIN outages

Scheduled Pinned Locked Moved Uptime Kuma
dns
27 Posts 9 Posters 23.1k Views 9 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • jdaviescoatesJ jdaviescoates

    I recently enabled email notifications on my Uptime Kuma and I've noticed there are regular short outages. Something to do with getaddrinfo EAI_AGAIN

    Screenshot from 2023-03-09 09-08-30.png

    Any idea what's causing this and how to resolve it?

    Apparently

    EAI_AGAIN is a DNS lookup timed out error, means it is a network connectivity error or proxy related error.

    According to https://stackoverflow.com/questions/40182121/whats-the-cause-of-the-error-getaddrinfo-eai-again

    Oh, and here it says:

    EAI_AGAIN means the DNS server replied that it cannot currently fulfill the request. (If you want the hairy details, the RCODE field in the response is set to 2, SERVFAIL.)

    There is no single solution because it entirely depends on why the DNS server sends that back. Maybe it's overloaded, maybe the network is down, maybe it got the same reply from its upstream server.

    In general, the best you can do is wait a while and try again. Hope that helps.

    I wonder what the specific issue is in my case (perhaps just network issues with Netcup?) 🤷

    jdaviescoatesJ Offline
    jdaviescoatesJ Offline
    jdaviescoates
    wrote on last edited by
    #2

    @jdaviescoates said in Regular short getaddrinfo EAI_AGAIN outages:

    perhaps just network issues with Netcup?

    I've just created a load of monitors to watch apps I've got on some Hetzner Could VPS servers too. I wonder if I'll see the same issues on there or not...

    I use Cloudron with Gandi & Hetzner

    1 Reply Last reply
    0
    • girishG Offline
      girishG Offline
      girish
      Staff
      wrote on last edited by
      #3

      Do you have other domains by any chance? And does this happen only for the coop domain ? Could be a TLD issue that their authoritative servers are flaky.

      jdaviescoatesJ 1 Reply Last reply
      1
      • girishG girish

        Do you have other domains by any chance? And does this happen only for the coop domain ? Could be a TLD issue that their authoritative servers are flaky.

        jdaviescoatesJ Offline
        jdaviescoatesJ Offline
        jdaviescoates
        wrote on last edited by
        #4

        @girish said in Regular short getaddrinfo EAI_AGAIN outages:

        Do you have other domains by any chance?

        Yes, lots.

        @girish said in Regular short getaddrinfo EAI_AGAIN outages:

        And does this happen only for the coop domain ?

        Not sure yet, but shall keep an eye on it.

        @girish said in Regular short getaddrinfo EAI_AGAIN outages:

        Could be a TLD issue that their authoritative servers are flaky.

        Sounds plausible.

        I use Cloudron with Gandi & Hetzner

        1 Reply Last reply
        0
        • RoundHouse1924R Offline
          RoundHouse1924R Offline
          RoundHouse1924
          wrote on last edited by
          #5

          This is an Uptime Kuma quirk that I solved by upping "Retries" on each affected monitor to at least 2. Just one of my 25 monitors needed retries upping to 5 for a fix.

          Hope this helps!

          jdaviescoatesJ 1 Reply Last reply
          3
          • RoundHouse1924R RoundHouse1924

            This is an Uptime Kuma quirk that I solved by upping "Retries" on each affected monitor to at least 2. Just one of my 25 monitors needed retries upping to 5 for a fix.

            Hope this helps!

            jdaviescoatesJ Offline
            jdaviescoatesJ Offline
            jdaviescoates
            wrote on last edited by jdaviescoates
            #6

            @RoundHouse1924 Thanks, although surely that could just mean that when it tried once it was broken but by the time it retried it was working again? Although I guess the time between retries is tiny?

            I use Cloudron with Gandi & Hetzner

            1 Reply Last reply
            1
            • girishG girish moved this topic from Support on
            • girishG Offline
              girishG Offline
              girish
              Staff
              wrote on last edited by
              #7

              I went and checked our instances' logs. I found sporadic ENOTFOUND but very spaced out (like months apart).

              1 Reply Last reply
              1
              • M Offline
                M Offline
                mazarian
                wrote on last edited by
                #8

                I know this is an old thread, but I'm also having this issue and curious if anyone has found a solution other than upping retries. Or perhaps there's another service similar to Uptime Kuma that doesn't have these issues?

                1 Reply Last reply
                1
                • girishG Offline
                  girishG Offline
                  girish
                  Staff
                  wrote on last edited by
                  #9

                  I don't see this in our instance atleast - https://status.cloudron.io . Which TLD are you using?

                  1 Reply Last reply
                  0
                  • M Offline
                    M Offline
                    mazarian
                    wrote on last edited by
                    #10

                    Hey thanks for getting back! My TLD is .LA.

                    1 Reply Last reply
                    0
                    • girishG Offline
                      girishG Offline
                      girish
                      Staff
                      wrote on last edited by girish
                      #11

                      @mazarian as a way to debug this further, maybe you can add a www.cloudron.io or some other domain (io or com) , to your Uptime Kuma. This well help us narrow if this is a general DNS/network issue or specific to your TLD.

                      1 Reply Last reply
                      1
                      • M Offline
                        M Offline
                        mazarian
                        wrote on last edited by
                        #12

                        That's a fantastic idea! I will add it. I ended up migrating it off Cloudron for the time being because I have come to depend on UK for work and all the notifications were bogging me down.

                        I will restart the old instance to test and will report back what I find out.

                        1 Reply Last reply
                        2
                        • jrl-abstract27J Offline
                          jrl-abstract27J Offline
                          jrl-abstract27
                          wrote on last edited by jrl-abstract27
                          #13

                          I do have the same kind of shortages with uptime kuma. I did add Cloudron to see. Any idea of what is happening ?

                          I have a dedicated server with hetzner.

                          CleanShot 2024-04-29 at 19.55.51@2x.png

                          London Web Agency : https://abstract27.com/

                          1 Reply Last reply
                          1
                          • girishG Offline
                            girishG Offline
                            girish
                            Staff
                            wrote on last edited by
                            #14

                            @jrl-abstract27 this is to do with the local DNS server (unbound) not resolving . In Cloudron 8 (the next release), we are removing unbound altogether and it will use your network's resolver via systemd-resolved. Maybe this issues gets sorted out with that.

                            F 1 Reply Last reply
                            1
                            • jrl-abstract27J Offline
                              jrl-abstract27J Offline
                              jrl-abstract27
                              wrote on last edited by
                              #15

                              thanks @girish

                              London Web Agency : https://abstract27.com/

                              1 Reply Last reply
                              0
                              • girishG girish

                                @jrl-abstract27 this is to do with the local DNS server (unbound) not resolving . In Cloudron 8 (the next release), we are removing unbound altogether and it will use your network's resolver via systemd-resolved. Maybe this issues gets sorted out with that.

                                F Offline
                                F Offline
                                factord
                                wrote on last edited by
                                #16

                                @girish

                                @girish said in Regular short getaddrinfo EAI_AGAIN outages:

                                @jrl-abstract27 this is to do with the local DNS server (unbound) not resolving . In Cloudron 8 (the next release), we are removing unbound altogether and it will use your network's resolver via systemd-resolved. Maybe this issues gets sorted out with that.

                                Hi, i just upgraded to Cloudron 8.0.3, rebooted server and i see Unbound still appears in the services pages, is that normal? does that mean uptime kuma still uses it?

                                1 Reply Last reply
                                2
                                • nebulonN Offline
                                  nebulonN Offline
                                  nebulon
                                  Staff
                                  wrote on last edited by
                                  #17

                                  unbound is still used in Cloudron, but its usage is drastically reduced now. It is used for directly querying the nameservers to check if DNS records are already in-sync to avoid hitting NXDOMAIN for newly installed apps as well as for email DNS record lookup.

                                  The rest now uses whatever the default setup, of the environment the server is running in, is.

                                  1 Reply Last reply
                                  2
                                  • girishG Offline
                                    girishG Offline
                                    girish
                                    Staff
                                    wrote on last edited by
                                    #18

                                    To conclude what @nebulon said, for Uptime Kuma this means that it uses the system DNS (and not unbound).

                                    F 1 Reply Last reply
                                    3
                                    • girishG girish

                                      To conclude what @nebulon said, for Uptime Kuma this means that it uses the system DNS (and not unbound).

                                      F Offline
                                      F Offline
                                      factord
                                      wrote on last edited by
                                      #19

                                      @girish ouch, i still have the EAI_AGAIN error, i suppose i have to check with uptime kuma then. Any suggestion? maybe our datacenter dns is overloaded and we should use google dns or something like that?

                                      1 Reply Last reply
                                      1
                                      • girishG Offline
                                        girishG Offline
                                        girish
                                        Staff
                                        wrote on last edited by girish
                                        #20

                                        @factord right, so this means this is either a local DNS issue or uptime kuma issue. Quick idea is to just set up the server's /etc/resolv.conf with Google DNS maybe and check if it mitigates the issue.

                                        1 Reply Last reply
                                        1
                                        • T Offline
                                          T Offline
                                          thoresson
                                          wrote on last edited by
                                          #21

                                          I've used Uptime Kuna without any problem since April. But for the last 24+ hours I suddenly have started to receive tons of these:

                                          CleanShot 2024-09-24 at 20.58.12@2x.png

                                          The affected hosts are on .se, and one .social. Some of them are hosted at the same box I run Cloudron on, some on others.

                                          I see that it's not the same error message as in the OP, but similar enough to be related?

                                          According to my logs, my Cloudron was updated to 8.0.4 on August 28, and 8.0.6 yesterday morning. I don't know for sure, but from the graphs it looks like the problem started as soon as that update had been installed. This is what the week graph looks like for or the monitored services with problems:

                                          CleanShot 2024-09-24 at 21.15.53@2x.png

                                          1 Reply Last reply
                                          2
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • Bookmarks
                                          • Search