Localizing Faults in Cloud Systems

03/01/2018
by   Leonardo Mariani, et al.
0

By leveraging large clusters of commodity hardware, the Cloud offers great opportunities to optimize the operative costs of software systems, but impacts significantly on the reliability of software applications. The lack of control of applications over Cloud execution environments largely limits the applicability of state-of-the-art approaches that address reliability issues by relying on heavyweight training with injected faults. In this paper, we propose (LOUD, a lightweight fault localization approach that relies on positive training only, and can thus operate within the constraints of Cloud systems. LOUD relies on machine learning and graph theory. It trains machine learning models with correct executions only, and compensates the inaccuracy that derives from training with positive samples, by elaborating the outcome of machine learning techniques with graph theory algorithms. The experimental results reported in this paper confirm that LOUD can localize faults with high precision, by relying only on a lightweight positive training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2021

Fault Localization in Cloud using Centrality Measures

Fault localization is an imperative method in fault tolerance in a distr...
research
09/05/2021

Tolerating Adversarial Attacks and Byzantine Faults in Distributed Machine Learning

Adversarial attacks attempt to disrupt the training, retraining and util...
research
11/21/2019

Predicting Failures in Multi-Tier Distributed Systems

Many applications are implemented as multi-tier software systems, and ar...
research
07/27/2020

A Machine Learning Approach to Online Fault Classification in HPC Systems

As High-Performance Computing (HPC) systems strive towards the exascale ...
research
11/25/2019

Distortion and Faults in Machine Learning Software

Machine learning software, deep neural networks (DNN) software in partic...
research
09/06/2021

Towards API Testing Across Cloud and Edge

API economy is driving the digital transformation of business applicatio...
research
08/13/2023

Optimizing the cloud? Don't train models. Build oracles!

We propose cloud oracles, an alternative to machine learning for online ...

Please sign up or login with your details

Forgot password? Click here to reset