Welcome!

Government Cloud Authors: Elizabeth White, Pat Romanski, Dana Gardner, Liz McMillan, Gopala Krishna Behara

Related Topics: @CloudExpo, Microservices Expo, Government Cloud

@CloudExpo: Blog Feed Post

Storm Clouds on the Horizon?

With more time and a calmer approach, however, most enterprises will realize that no cloud implementation is fail-proof

The recent Amazon cloud outage at its Northern Virginia data center will raise concerns among CIOs looking to the “public cloud” to improve IT service delivery and reduce day-to-day operating costs.  Industry claims of superior “up-time” performance, reliability and massive redundancy must now be revisited and re-evaluated.  In these early days immediately following the outage, the prevailing sentiment likely will be of shaken confidence in “public cloud” services accompanied by much finger-pointing amid claims of breached service level agreements (SLA’s) and misrepresentation of the cloud’s resiliency.  With more time and a calmer approach, however, most enterprises will realize that no cloud implementation is fail-proof and that shifting significant portions of their enterprises to the cloud should not be undertaken without adequate contigency plans and risk mitigation.

The outage appeared to be limited to a single “availability zone” in only one region.  AWS customers that heeded Amazon’s advice to spread their services across multiple availability zones to ensure resiliency felt little impact. Those that chose not to pay the extra cost associated with multiple zones may be regretting that decision now.  Amazon claims its web services are now operating normally for most customers and the company said it will post a detailed report on what went wrong last week.

The after-action report is going to be interesting but we’re already lots of lessons learned spring up.  The key points seem to be:

  • Cloud outages may be rare but they can happen.   Enterprises need to architect their cloud services for failure by spreading mission-critical, customer-facing services across multiple zones, physical separate data centers and/or multiple cloud providers.  Putting 100% of your IT service eggs in one “cloud” basket is risky.
  • Service level agreements are important and will become even more so.  Gartner’s Lydia Leong provides a great recap of what went wrong and insight into Amazon’s standard EC2 SLAs.

Amazon’s SLA for EC2 is 99.95% for multi-availability zone deployments.  That means you should expect that you can have about 4.5 hours of total region downtime each year without Amazon violating their SLA.  Note, by the way that this outage does not actually violate their SLA.  Their SLA defines unavailability as a lack of external connectivity to EC2 instances, coupled with the inability to provision working instances.  In this case, EC2 was just fine by that definition.  It was Elastic Block Store (EBS) and Relational Database Service (RDS) which weren’t, and neither of those services have SLAs.

It seems likely that some of Amazon’s customers, those most affected by the outage, likely will seek to renegotiate SLA’s to ensure they have more “teeth” and offer compensation for lost revenue.

  • Plan for disaster and how to recovery from it–even in the cloud.  The outage highlighted the fact that some of Amazon’s customers did not have a disaster recovery strategy.  A number of  solution providers  now cover cloud disaster recovery and most can recover physical or virtual machines in a cloud within minutes.  But if you don’t have plan for it, it won’t happen.
  • Cloud deployments are still cheaper that traditional data centers.  For most organizations, the cost of deploying in the cloud remains about 10 times cheaper than building your own data center or even private cloud.
  • Hybrid IT Service Delivery Can Help Reduce Cloud Exposure.  Most mature enterprises move to the cloud in stages resulting in a hybrid portfolio of IT services provisioned from a mix of public cloud, private cloud and traditional IT infrastructure.  While these hybrid service models can be challenge to manage effectively, they do offer some built in resiliency in case any one segment fails.

This post sponsored by the Enterprise CIO Forum and HP.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder of Crucial Point and publisher of CTOvision.com

IoT & Smart Cities Stories
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
DXWorldEXPO LLC announced today that Big Data Federation to Exhibit at the 22nd International CloudEXPO, colocated with DevOpsSUMMIT and DXWorldEXPO, November 12-13, 2018 in New York City. Big Data Federation, Inc. develops and applies artificial intelligence to predict financial and economic events that matter. The company uncovers patterns and precise drivers of performance and outcomes with the aid of machine-learning algorithms, big data, and fundamental analysis. Their products are deployed...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...
The challenges of aggregating data from consumer-oriented devices, such as wearable technologies and smart thermostats, are fairly well-understood. However, there are a new set of challenges for IoT devices that generate megabytes or gigabytes of data per second. Certainly, the infrastructure will have to change, as those volumes of data will likely overwhelm the available bandwidth for aggregating the data into a central repository. Ochandarena discusses a whole new way to think about your next...
CloudEXPO | DevOpsSUMMIT | DXWorldEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
All in Mobile is a place where we continually maximize their impact by fostering understanding, empathy, insights, creativity and joy. They believe that a truly useful and desirable mobile app doesn't need the brightest idea or the most advanced technology. A great product begins with understanding people. It's easy to think that customers will love your app, but can you justify it? They make sure your final app is something that users truly want and need. The only way to do this is by ...
Cell networks have the advantage of long-range communications, reaching an estimated 90% of the world. But cell networks such as 2G, 3G and LTE consume lots of power and were designed for connecting people. They are not optimized for low- or battery-powered devices or for IoT applications with infrequently transmitted data. Cell IoT modules that support narrow-band IoT and 4G cell networks will enable cell connectivity, device management, and app enablement for low-power wide-area network IoT. B...
The hierarchical architecture that distributes "compute" within the network specially at the edge can enable new services by harnessing emerging technologies. But Edge-Compute comes at increased cost that needs to be managed and potentially augmented by creative architecture solutions as there will always a catching-up with the capacity demands. Processing power in smartphones has enhanced YoY and there is increasingly spare compute capacity that can be potentially pooled. Uber has successfully ...
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
When talking IoT we often focus on the devices, the sensors, the hardware itself. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). When we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing. IoT is not about the devices, its about the data consumed and generated. The devices are tools, mechanisms, conduits. This paper discusses the considerations when dealing with the...