Amazon's outage in third day: debate over cloud computing's future begins

As Amazon’s web services outage passed its third day, the debate on the future of cloud computing is underway. The outage is costing web sites such as Reddit and Quora considerable losses as users turn elsewhere to get their social media needs met.

Amazon’s Elastic Compute Cloud service hosts thousands of major web sites that rely on it to serve pages to users. And users rely on these services to store their personal accounts and data remotely. So when the EC 2 service goes down, so do the web sites, and that means users can’t log in to access their data. It’s a big hiccup for an industry that is supposed to grow to $55 billion by 2014, according to market researcher IDC.

The duration of the outage has surprised many, since Amazon has a lot of backup computing infrastructure. If Amazon can’t safeguard the cloud, how can we rely on it? So the debate begins on the future of cloud computing and what to do to make users and companies put their trust in cloud vendors such as Amazon.

The good thing about the cloud is that it protects users when their own home computers crash and lose data. But the rotten part about the crash of the cloud is that millions upon millions of users become helpless, and any recovery of the data is beyond their control. Some sites spend the money to run mirror sites on other cloud vendors, so the sites can remain functional even if one cloud vendor goes down. But that’s an expensive measure that many web sites haven’t taken.

“This is a wake-up call for cloud computing,” Matthew Eastwood, an analyst for the research firm IDC, told the New York Times. “It will force a conversation in the industry.”

Corporations will have to decide what computer operations to put on a cloud operated by external vendors and how much they should keep inside their own internal data centers. They will also have to figure out the right policies for backup and recovery services. And they will have to decide whether to allocate more money to backup data centers in multiple locations.

That discussion, he said, will most likely center on what data and computer operations to send off to the cloud and what to keep inside the corporate walls. Netflix uses Amazon, but it hasn’t gone offline because it fully uses Amazon’s redundant cloud backup infrastructure. For most startups, those are luxuries that are too expensive, despite the risks.

Amazon Web Services is supposed to offload all of those worries. It has the expertise, the scale, and the access to massive amounts of cheap computing power. Amazon clients that are still having trouble include Foursquare, Quora, Reddit and BigDoor.  The New York Times said the current problem originated in one of Amazon’s data centers in Northern Virginia. BigDoor, which offers gamification services to big companies, went down because its backup and recovery services were limited to Amazon’s data center in Virginia. BigDoor restored service by Friday evening, but its web site was still down. Amazon has been cryptic about the cause and has only said that matters are improving but the problem is still not resolved.

George Reese of O’Reilly wrote that Amazon’s setback is a learning experience and is, for the most part, a shining example of how the cloud works properly. That seems like a severe attempt to see the glass as half full. He argued that the web site developers should have planned for this kind of outage and taken advantage of Amazon’s full backup capabilities.

Eventually, the cloud will become like a utility. You can get as much computing power as you want with the flip of a switch and you won’t have to worry about outages as much over time. But we’re clearly not there yet.

  • http://profiles.yahoo.com/u/V7LGTBIAONDXFOC4A7LRH6LD7U George

    The outage is pretty bad on a number of levels and you do raise a number of important questions. But I want to question this statement:”The outage is costing web sites such as Reddit and Quora considerable losses as users turn elsewhere to get their social media needs met.”Do you have any hard data to support such a statement? Easter is always one of the slowest traffic periods on the net meaning people really are not checking out their social streams, etc., so, in a way, Amazon and its customers were quite lucky this occurred during the Easter weekend. In addition, the two apps you mentioned are not exactly must haves for their customers such that they will immediately go looking for alternatives. One of Quora's benefits is the wealth of info it already contains and that “feature” does not disappear because of this outage. Before putting forth such a blatant and powerful statement, be prepared to back it up. Journalism 101 – otherwise, people will go in search of other writers ;) .

  • http://www.ashleypearson.net Ashley Pearson

    Owned?

  • http://twitter.com/analyticbridge Vincent Granville

    I checked Quora this morning. No sure if the outage is still going on, but Quora was working, fine, a bit slow, but just like usual.

  • http://twitter.com/edenciso Eduardo Enciso

    This Amazon outage has been the best event to happen to the cloud going forward, as many companies will be forced to rethink their cloud strategies to include more robust risk management and disaster recovery policies in their deployment roadmaps.At the same time, it is a wake-up call to public cloud service providers, so they will need to effectively convince customers that this won't become a potential issue for adoption, by integrating into their offerings provisioning and fail-over management capabilities along with more transparent SLAs.I'm convinced the future for delivery models will be an icreasing and dominant adoption for hybrid cloud offerings as it will be almost impossible to overcome concerns about service disruption, data privacy and security.

  • http://twitter.com/FCathala Fabrice Cathala

    Agree on the positive side of the wake-up call for the cloud industry, not so much on the move to hybrid cloud as a HA/DRP alternative.Moving to Hybrid is not fixing the issue, it's changing the problem and will only be better until another “unforeseen circumstance” generate a massive clash. There is no bug free nor error free environment whatever choices are made. We know that the customers impacted didn't “architecture for the cloud” which is going to be an issue in Hybrid Cloud environment.I would personally favour an improvement in openness and two-ways communication between the service provider and its customer. The whole system must work as a single team from the IaaS NOC to the IT manager on the customer side and it seems that wasn't the case. Some customers were ready to swap to “DRP mode” but didn't get a good enough ETA assessment from Amazon.Cheers,Fabrice

  • runningcode

    Amazon's EC2 service does not stand for Elastic Cloud 2. there was no first version. It stands for elastic cloud compute. Your link in your article to Amazon's web page demonstrates this.

  • http://onlineguitarcoaching.com Will Kriski

    Maybe the cloud should be what you said early on, a place to backup your home computer files. In other words it's not the main source of storage. when you introduce a 3rd party system you also introduce dependencies on that system. Or maybe it will never fail again so we don't have to worry.

  • http://www.facebook.com/people/Mullet-Reeve/100000516994402 Mullet Reeve

    Having hard data isn't a requirement to PROVIDE the hard data, especially when it was shared to you in confidence or on condition that it NOT get released.If you want proof, then show us where Reddit and/or Quora dispute the statement. Surely if they were NOT experiencing losses they would take issue with it and deny it, yes?Do the math and stop being pedantic.

  • http://www.facebook.com/people/Mullet-Reeve/100000516994402 Mullet Reeve

    Having hard data isn't a requirement to PROVIDE the hard data, especially when it was shared to you in confidence or on condition that it NOT get released.If you want proof, then show us where Reddit and/or Quora dispute the statement. Surely if they were NOT experiencing losses they would take issue with it and deny it, yes?Do the math and stop being pedantic.

  • http://virtualizationcloudcomputing.com/cloud-computing-future/ Cloud Computing Future | Computer Blog | Computer Tips and Tricks | Everything you need to know

    [...] this occurred during the Easter weekend. Agree on the positive side of the wake-up call for the Cloud industry, not so much on the move to hybrid Cloud as a HA/DRP [...]

  • http://recentnews.eu/5-ways-technology-is-actually-getting-worse-yahoo-news/ 5 ways technology is actually getting worse (Yahoo! News) | Recent News

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://newsperuse.com/5-ways-technology-is-actually-getting-worse-yahoo-news.html 5 ways technology is actually getting worse (Yahoo! News) | Breaking News Today

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://www.popularscience.eu/5-ways-technology-is-actually-getting-worse-yahoo-news.html/ 5 ways technology is actually getting worse (Yahoo! News) | Popular Science

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://technewsguru417.wordpress.com/2011/08/10/the-tech-life-how-much-wow-can-it-be/ The Tech Life, How Much Wow Can It Be? « Technewsguru417's Blog

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://donnachaoflahertymultimedia.wordpress.com/2011/10/11/kevin-kellys-video/ Kevin Kelly’s Video. « donnachaoflahertymultimedia

    [...] Having watched the video I thought the idea of a cloud computer was very fascinating. The fact that computers would “touch the cloud” as Kevin said instead of having hardrives seems amazing. Being able to simply buy more power online or buy renting software online instead of physically installing the software on your hardrive would be both cost effective for the selling company and the buying customer. Data would be much safer if it was saved in the “cloud” then on a physical hardrive  from water damage or electrical damage for example. However companies such as Amazon who are thought to be at the for-front of cloud computing have had there problems as seen in the article below. Although I do believe cloud computing is the future i think there are still some problems to be ironed out and some public opinions that need to be changed. http://venturebeat.com/2011/04/23/amazons-outage-in-third-day-debate-over-cloud-computings-future-be… [...]

  • http://www.experts-bto.com/2012/04/27/5-ways-technology-is-actually-getting-worse/ 5 ways technology is actually getting worse | Experts Bto

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://www.molecularstudios.com/mobile-news/5-ways-technology-is-actually-getting-worse/ 5 ways technology is actually getting worse | molecular studios

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://www.opensourceportal.org/5-ways-technology-is-actually-getting-worse/ 5 ways technology is actually getting worse – Open Source Portal | Open Source Portal

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://techn4all.in/?p=3717 5 ways technology is actually getting worse | latest techn

    [...] it directly in the hands of a company you trust. If the service flickers or goes offline — like Amazon’s 4 day-long server outage this April — you’ll be up a creek without your [...]

  • http://www.addfree.eu/5-ways-technology-is-actually-getting-worse/ 5 ways technology is actually getting worse | AddFree.eu

    [...] the service flickers or goes offline — like Amazon's 4 day-long server outage this Apr — you'll be up a rivulet though [...]

  • http://cloud-computing-economics.com/business-benefits-applications/amazons-ec2-cloud-computing-service-fourth-day/ Amazon’s EC2 cloud computing service down the fourth day | Cloud Computing Economics

    [...] I stumble upon this article by Dean Takahashi with the title, “Amazon’s outage in third day: debate over cloud computing’s future begins”. Dropbox’s outage is part of a huge outage of Amazon’s EC2 cloud service due to [...]

  • http://www.articlesandproducts.com/traffic-generation/make-money-with-amazon-on-true-auto/ making money with amazon

    making money with amazon…

    Amazon’s outage in third day: debate over cloud computing’s future begins | VentureBeat…

  • http://venturebeat.com/2012/06/29/amazon-outage-netflix-instagram-pinterest/ Amazon cloud outage takes down Netflix, Instagram, Pinterest, & more | VentureBeat

    [...] just-introduced Compute Engine. Some of Amazon’s biggest major outages occurred in April and August of last [...]

  • http://www.simplyboundless.com/2012/06/amazon-cloud-outage-takes-down-netflix-instagram-pinterest-more/ Amazon cloud outage takes down Netflix, Instagram, Pinterest, & more | Simply Boundless Entertainment

    [...] just-introduced Compute Engine. Some of Amazon’s biggest major outages occurred in April and August of last [...]

  • http://www.notipra.com/2012/06/30/amazon-cloud-outage-takes-down-netflix-instagram-pinterest-more/ Amazon cloud outage takes down Netflix, Instagram, Pinterest, & more | NotiPra.Com – La Noticia Del Momento!!

    [...] just-introduced Compute Engine. Some of Amazon’s biggest EC2 outages occurred in April and August of last [...]

blog comments powered by Disqus