Glitches in Amazon Web Services’ North Virginia facility caused problems Sunday for Netflix, Tinder and other major websites.
Sourced through Scoop.it from: www.pcworld.com
Netflix, Tinder and other major websites were affected for a time Sunday by glitches in Amazon Web Services’ Northern Virginia facility, offering a cautionary lesson to other companies that rely on the cloud service for mission-critical capabilities. The problem manifested itself primarily in the form of higher-than-normal error rates. Sites affected reportedly also included IMDb and Amazon’s Instant Video and Books websites. At the heart of the snafu were issues with AWS’s DynamoDB database, but it spread to include other services such as EC2, the mobile-focused Cognito service and the CloudWatch monitoring service, according to the AWS Service Health Dashboard. “The root cause began with a portion of our metadata service within DynamoDB,” AWS explained in a dashboard update posted at 4:52 a.m. PDT on Sunday. “This is an internal sub-service which manages table and partition information. Our recovery efforts are now focused on restoring metadata operations. We will be throttling APIs as we work on recovery.” Indeed, “the fix was applied quickly, AWS owned it and recovery started almost immediately,” agreed Dave Bartoletti, a principal analyst with Forrester. “In my experience, AWS can handle one or two of these a year without significantly scaring customers.”
More than anything, he added, “it’s a wake-up call to design your cloud apps for failure.”