Cloud services not always 100% up, Outages today in major providers

    Cloud services not always 100% up, Outages today in major providers

    Cloud services not always 100% up, Outages today in major providers


    Hi folks
    While a lot here like using some Cloud services these things aren't immune from outages -- even the *Mighty Amazon* was out for part of today in quite a large part of the world.

    A major outage has affected a number of major websites including Amazon, Reddit and Twitch.

    The UK government website - gov.uk - was also down as were the Financial Times, the Guardian and the New York Times.

    Cloud computing provider Fastly, which underpins a lot of major websites, said it was behind the problems.

    The firm said there were issues with its global content delivery network (CDN) and was implementing a fix.

    In a statement, it said: "We identified a service configuration that triggered disruption across our POPs (points of presence) globally and have disabled that configuration.

    A POP allows content to be sent from globally distributed servers that are close to the end use.

    "Our global network is coming back online."
    Read more Websites begin to work again after major breakage - BBC News

    Cheers
    jimbo
    jimbo45's Avatar Posted By:

  1. pparks1's Avatar
    Posts : 1,859
    Windows 10 Pro
       #1

    It's very true, no cloud provider is available 100% of the time. It's predominately the reason why in the industry we try to utilize multiple cloud providers to ensure less chance of a systems outage. This also protects against internet outages as well, which happen from time to time.

    Looking at cloud providers as a whole, while outages do occur from time to time....rarely, if ever, is any data actually lost. If you compare that with companies who do things internally, how often do many of them have 100% uptime or 0 loss of data.. For example, when has a company ever found that a system failed and their backups didn't work and they lost data....this happens with some regularity. Or if a system fails midday, they have to revert to a backup from a previous point in time and they lost X number of hours of data.
      My Computers

  2. jimbo45's Avatar
    Posts : 10,818
    Windows / Linux : Arch Linux
    Thread Starter
       #2

    Hi there

    @pparks1

    Multiple providers definitely help but one has to then consider data security -- if backups can be restored from various sites and providers that also opens up a whole new can of worms as regards security etc.

    It just seems as some of those journos suggest that the more we depend on these sorts of services - outages from whatever reason and however small are going to cause massive disruption in how business etc is conducted today and we need to be mindful of the consequencies if we don't have other emergency systems in place to continue commerce while the main service is being repaired / resumed.

    Perhaps these sorts of lessons are best learnt these days before we are 100% reliant on these sorts of providers so we can have systems in place in the case of these things going down either via wilful hacking or just Software glitches (or both).

    We've probably been too complacent on "The Resiliency" of the Cloud" - which so far hasn't been that bad actually - but the more you rely on it without some alternative strategy in place if part of it fails then the worse the effects will be when the inevitable happens.

    Data Loss isn't the only problem here -- if a business for example relies on real time transactions -- e.g FOREX (Foreign Exchange) or other types of transactions --particularly in Stocks etc - Data Loss isn't the main issue -- restoring these systems back say a few hours won't lose data but the amount of trading losses could run into trillions of dollars since the actual online real time trading would not have been captured since the last backup.

    Cheers
    jimbo
      My Computer

  3. pparks1's Avatar
    Posts : 1,859
    Windows 10 Pro
       #3

    jimbo45 said:
    Multiple providers definitely help but one has to then consider data security -- if backups can be restored from various sites and providers that also opens up a whole new can of worms as regards security etc.
    Absolutely true, hence the reason that as an enterprise that relies heavily on AWS, my organization encrypts everything. EBS volumes on EC2 instances are encrypted at rest, databases themselves are encrypted at rest, everything is https and encrypted in transit, etc.

    jimbo45 said:
    It just seems as some of those journos suggest that the more we depend on these sorts of services - outages from whatever reason and however small are going to cause massive disruption in how business etc is conducted today and we need to be mindful of the consequencies if we don't have other emergency systems in place to continue commerce while the main service is being repaired / resumed.
    Yeah, the cloud is a totally different mindset.

    In the old days, we racked and stacked servers and we troubleshot problems on the servers and we backed up the data on the server so that we could restore it. We maintained systems for years and had extensive documentation. When a server went down, the techs were paged and we logged in to fix the issue.

    Now, we build everything using infrastructure as code. Server are deployed via this code, and the data on the server itself is not critical. When a server goes down, the instance is simply terminated, and a brand new image is spun up to take the place of the broken server. We don't often troubleshoot a problem on a node anymore, we just check logging in an enterprise tool like Splunk. We can easily roll out new code multiple times per day, and if a problem exists, we simply roll the change back out. And 90% of the time, we don't use VM's anymore either, it's all containers and Kubernetes.

    jimbo45 said:
    We've probably been too complacent on "The Resiliency" of the Cloud" - which so far hasn't been that bad actually - but the more you rely on it without some alternative strategy in place if part of it fails then the worse the effects will be when the inevitable happens.
    Yes, that's why the cloud providers replicate your data to multiple data centers, and have geographical presence in multiple countries all over the globe. And also the reason why organizations like to have multi-cloud presence too to protect against that random outage.
      My Computers


 

  Related Discussions
Our Sites
Site Links
About Us
Windows 10 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 10" and related materials are trademarks of Microsoft Corp.

Designer Media Ltd
All times are GMT -5. The time now is 18:29.
Find Us




Windows 10 Forums