BatchLens: A Visualization Approach for Analyzing Batch Jobs in Cloud Systems

12/31/2021
by   Shaolun Ruan, et al.
0

Cloud systems are becoming increasingly powerful and complex. It is highly challenging to identify anomalous execution behaviors and pinpoint problems by examining the overwhelming intermediate results/states in complex application workflows. Domain scientists urgently need a friendly and functional interface to understand the quality of the computing services and the performance of their applications in real time. To meet these needs, we explore data generated by job schedulers and investigate general performance metrics (e.g., utilization of CPU, memory and disk I/O). Specifically, we propose an interactive visual analytics approach, BatchLens, to provide both providers and users of cloud service with an intuitive and effective way to explore the status of system batch jobs and help them conduct root-cause analysis of anomalous behaviors in batch jobs. We demonstrate the effectiveness of BatchLens through a case study on the public Alibaba bench workload trace datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2021

Towards Accommodating Real-time Jobs on HPC Platforms

Increasing data volumes in scientific experiments necessitate the use of...
research
07/30/2019

CloudDet: Interactive Visual Analysis of Anomalous Performances in Cloud Computing Systems

Detecting and analyzing potential anomalous performances in cloud comput...
research
08/05/2020

Best of Both Worlds: High Performance Interactive and Batch Launching

Rapid launch of thousands of jobs is essential for effective interactive...
research
08/04/2023

A Deep Dive into the Google Cluster Workload Traces: Analyzing the Application Failure Characteristics and User Behaviors

Large-scale cloud data centers have gained popularity due to their high ...
research
04/11/2019

A Processor-Sharing model for the Performance of Virtualized Network Functions

The parallel execution of requests in a Cloud Computing platform, as for...
research
02/17/2023

CarbonScaler: Leveraging Cloud Workload Elasticity for Optimizing Carbon-Efficiency

Cloud platforms are increasingly emphasizing sustainable operations in o...
research
04/16/2018

Chronos: A Unifying Optimization Framework for Speculative Execution of Deadline-critical MapReduce Jobs

Meeting desired application deadlines in cloud processing systems such a...

Please sign up or login with your details

Forgot password? Click here to reset