• Dave@lemmy.nz
      link
      fedilink
      English
      arrow-up
      6
      ·
      9 hours ago

      But how does this happen? Surely Google has the ability to make highly available systems that are resistant to power going out at one of the three locations (as per the article).

      • jmcs@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        4
        ·
        edit-2
        9 hours ago

        That doesn’t help if they have software that assumes it can reach all sites. I remember a few years ago AWS had a EC2 outage in eu-central-1 because of 1 of the Availability Zones went down and the service that allocates instances threw a 500 when it failed to get that AZ’s capacity instead of just allocating the instances to the other 2 AZs.

        • Dave@lemmy.nz
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          8 hours ago

          I get how it’s possible, but this is Google. Surely they have decades of experience at keeping a website up no matter what happens!

          • Evil_incarnate
            link
            fedilink
            English
            arrow-up
            11
            ·
            6 hours ago

            Companies are made up of people. Companies save money by firing the most expensive people, the most experienced. The ones left have a lot less experience.

          • 31ank@ani.social
            link
            fedilink
            English
            arrow-up
            4
            ·
            edit-2
            6 hours ago

            You could also say its AWS. They also should have the experience, but mistakes happen