Machine Learning Framwork for Performance Anomaly in OpenMP Multi-Threaded Systems

11/03/2020
by   Weidong Wang, et al.
0

Some OpenMP multi-threaded applications increasingly suffer from performance anomaly owning to shared resource contention as well as software- and hardware-related problems. Such performance anomaly can result in failure and inefficiencies, and are among the main challenges in system resiliency. To minimize the impact of performance anomaly, one must quickly and accurately detect and diagnose the performance anomalies that cause the failures. However, it is difficult to identify anomalies in the dynamic and noisy data collected by OpenMP multi-threaded monitoring infrastructures. This paper presents a novel machine learning framework for performance anomaly in OpenMP multi-threaded systems. To evaluate our framework, the NAS Parallel NPB benchmark, EPCC OpenMP micro-benchmark suite, and Jacobi benchmark are used to test the performance of our framework proposed. The experimental results demonstrate that our framework successfully identifies 90.3% of injected anomalies of OpenMP multi-threaded applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
11/03/2020

Heartbeat Diagnosis of Performance Anomaly in OpenMP Multi-Threaded Systems

This paper presents a novel heartbeat diagnosis regarding performance an...
research
11/12/2022

Unsupervised Anomaly Appraisal of Cleft Faces Using a StyleGAN2-based Model Adaptation Technique

This paper presents a novel machine learning framework to consistently d...
research
05/21/2018

Identifying OSPF Anomalies Using Recurrence Quantification Analysis

Open Shortest Path First (OSPF) is one of the most widely used routing p...
research
12/15/2022

A Comprehensive Study on Off-path SmartNIC

SmartNIC has recently emerged as an attractive device to accelerate dist...
research
03/09/2021

Learning Dependencies in Distributed Cloud Applications to Identify and Localize Anomalies

Operation and maintenance of large distributed cloud applications can qu...
research
03/12/2021

Reptile: Aggregation-level Explanations for Hierarchical Data

Recent query explanation systems help users understand anomalies in aggr...
research
04/25/2020

Urban Anomaly Analytics: Description, Detection, and Prediction

Urban anomalies may result in loss of life or property if not handled pr...

Please sign up or login with your details

Forgot password? Click here to reset