Data Center Outages and USDC Technology’s Effort in Service Leveraging

Date:

In September 2018, we did witness a severe outage of a data center in Vietnam for nearly 2.5 hours. This event utterly bewildered internet users and raised considerable doubt in data center operation. As more workloads find their way to the cloud and demand increases, it would not be surprising for cloud customers to experience more outages. Yet this has not proved to be the case—at least not directly. According to the Uptime Institute’s 2022 annual outage analysis, over the past three years, public cloud outages are occurring at about the same historical rate. In Uptime’s 2022 Data Center Resiliency Survey, 80% of data center managers and operators said they had experienced an outage within the past three years.

Causes of Outage have not much changed

Analyzing human error — with a view to preventing it — has always been challenging for data center operators. The cause of a failure can lie in how well a process was taught, how tired, well-trained, or resourced the staff is, or whether the equipment itself was unnecessarily difficult to operate.

Most common causes of major human error-related outages
There are also definitional questions: If a machine fails due to a software error at the factory, is that human error? In addition, human error can play a role in outages that are attributed to other causes. For these reasons, Uptime Institute tends to research human error in a different way to other causes — we view it as one causal factor in outages, and rarely a single or root cause. Using this methodology, Uptime estimates (based on 25 years of data) that human error plays some role in about two-thirds of all outages.
In Uptime’s recent surveys on resiliency, we have tried to understand the makeup of some of these human error-related failures. As the figure above shows, human error-related outages are most commonly caused either by staff failing to follow procedures (even where they have been agreed and codified) or because the procedures themselves are faulty.
Historically, a loss of power is the No. 1 cause of significant outages. But most of those outages, too, were caused by human error. Over the last 25 years, electrical failures “have accounted for 80% of all IT load losses in data centers,” the report said.

Networking- and connectivity-related issues represent another leading cause of significant outages. These outages are driven primarily by the fact that IT architectures and application topologies are becoming more complex by the day. Organizations today increasingly rely on a mix of on-premises hardware and services, multiple cloud providers, and third-party APIs running in containers and virtual machines.

Even though all of the technologies involved were functioning as designed, when combined into a system, they created a situation that led to an outage.

Increase in Recovery Time & Cost

As with human error, according to Miles, increasing recovery times also affect the cloud reliability equation.

Agile programming methodologies, DevOps, and automated continuous integration and deployment (CI/CD) pipeline all push updates from development into production more quickly. This can create situations where IT administrators (who are tasked with the mundane business of keeping lights on) and developers don’t talk to one another about important production-related issues. This lack of communication means IT is not up to date on all of the changes that are taking place in the production environments it is charged with maintaining. When outages happen, it can take IT much longer to figure out what the root causes are and what to do about them.

Thoroughly understanding the above concerns of customers, USDC Technology always appreciates the meticulousness in each single phase of data center construction and operation. We provide an integrated design-build-operate process in our consultancy services. Our accredited team will verify and validate the design that is consistent and meets the requirements following the standards, such as Uptime Tier, ANSI/TIA-942, and TCVN9250,…hence reducing unnecessary iterative process in submitting the acceptable design by the mentioned standards. Furthermore, we will definitely enable you to own a standardized construction with the most optimized expenses in building up and operation.

——————————
About USDC Technology
Universal Smart Data Center Technology was established by people with a broad vision of Information and Communication Technology. They are a professional and leading company in technology construction for Smart Data Center services in Vietnam and the region. Its commitment to taking total ownership of projects has resulted in an enviable client portfolio, featuring some of the most renowned brands. Its mission is to deliver society the most optimal products and services by applying the latest technologies.
 ——————————
USDC Technology Data Center
Launched 11/2020, USDC Technology Data Center was built on class Tier III. Located in Sai Gon Hitech Park, District 9, Ho Chi Minh City, Vietnam (so-called Vietnam’s Silicon Valley). The data center is available in connection with all large networks, located in a strategic site to cover the East of Ho Chi Minh city. Our world-class data centers provide full-scale services. At USDC Technology, nothing is of greater importance to us than keeping your applications online and your data secure.
——————————
News Contact:
Universal Smart Data Center Technology
Phone: (+84) 28 73080708
Email: info@usdc.vn

 

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Share post:

Popular