AWS builds a DNS backstop to allow changes when its notoriously flaky US East region wobbles
The cause of major internet outages is often the domain name system (DNS) and/or problems at Amazon Web Services’ US East region. The cloud giant has now made a change that will make its own role in such outages less painful.
As explained in a Wednesday post, AWS customers told the cloud colossus “they need additional DNS resilience capabilities to meet their business continuity requirements and regulatory compliance obligations.”
“Organizations in regulated industries like banking, FinTech, and SaaS want the confidence that they will be able to make DNS changes even during unexpected regional disruptions, allowing them to quickly provision standby cloud resources or redirect traffic when needed,” the post adds.
AWS’s response to those needs is a feature “designed to provide a 60-minute recovery time objective (RTO) during service disruptions in the US East.”
Elsewhere in the post, AWS says the feature targets “DNS changes that customers can make within 60 minutes of a service disruption in the US East.”
That still leaves plenty of time for problems at the cloudy region to create a big blast radius of outages and service interruptions – and potential for worse interruptions if AWS doesn’t meet its RTO.
The mere fact that AWS has created this service speaks to the long history of problems at US East, including the DynamoDB debacle on October 20th, a VM problem a few days later, plus significant outages in 2021 and 2023.
AWS, of course, knew about those problems, and as far back as 2022 analyst firm Gartner warned its customers that US East represents a weak point in the Amazonian cloud that impairs its ability to handle crises.
Yet last year, AWS told The Register that the scale of US East is not less reliable than its other regions, but operates at such colossal scale that it stresses cloud services more sternly than its smaller installations.
And here we are, less than six weeks after an especially bad problem at US East earned AWS criticism, and the cloud giant has found a way to increase resilience. ®