In Datacenter Performance, The Only Constant Is Change

03/10/2020
by   Dmitry Duplyakin, et al.
0

All computing infrastructure suffers from performance variability, be it bare-metal or virtualized. This phenomenon originates from many sources: some transient, such as noisy neighbors, and others more permanent but sudden, such as changes or wear in hardware, changes in the underlying hypervisor stack, or even undocumented interactions between the policies of the computing resource provider and the active workloads. Thus, performance measurements obtained on clouds, HPC facilities, and, more generally, datacenter environments are almost guaranteed to exhibit performance regimes that evolve over time, which leads to undesirable nonstationarities in application performance. In this paper, we present our analysis of performance of the bare-metal hardware available on the CloudLab testbed where we focus on quantifying the evolving performance regimes using changepoint detection. We describe our findings, backed by a dataset with nearly 6.9M benchmark results collected from over 1600 machines over a period of 2 years and 9 months. These findings yield a comprehensive characterization of real-world performance variability patterns in one computing facility, a methodology for studying such patterns on other infrastructures, and contribute to a better understanding of performance variability in general.

READ FULL TEXT
research
12/14/2020

Prediction of High-Performance Computing Input/Output Variability and Its Application to Optimization for System Configurations

Performance variability is an important measure for a reliable high perf...
research
06/05/2022

BenchFaaS: Benchmarking Serverless Functions in an Edge Computing Network Testbed

The serverless computing model has evolved as one of the key solutions i...
research
10/27/2022

Noise in the Clouds: Influence of Network Performance Variability on Application Scalability

Cloud computing represents an appealing opportunity for cost-effective d...
research
12/13/2018

Continuous evaluation of the performance of cloud infrastructure for scientific applications

Cloud computing recently developed into a viable alternative to on-premi...
research
12/28/2022

Web-based volunteer distributed computing for handling time-critical urgent workloads

Urgent computing workloads are time critical, unpredictable, and highly ...
research
09/21/2023

A Multi-faceted Analysis of the Performance Variability of Virtual Machines

Cloud computing and virtualization solutions allow one to rent the virtu...
research
06/19/2019

Collecting and Presenting Reproducible Intranode Stencil Performance: INSPECT

Stencil algorithms have been receiving considerable interest in HPC rese...

Please sign up or login with your details

Forgot password? Click here to reset