DeepAI
Log In Sign Up

Localizing Faults in Cloud Systems

03/01/2018
by   Leonardo Mariani, et al.
0

By leveraging large clusters of commodity hardware, the Cloud offers great opportunities to optimize the operative costs of software systems, but impacts significantly on the reliability of software applications. The lack of control of applications over Cloud execution environments largely limits the applicability of state-of-the-art approaches that address reliability issues by relying on heavyweight training with injected faults. In this paper, we propose (LOUD, a lightweight fault localization approach that relies on positive training only, and can thus operate within the constraints of Cloud systems. LOUD relies on machine learning and graph theory. It trains machine learning models with correct executions only, and compensates the inaccuracy that derives from training with positive samples, by elaborating the outcome of machine learning techniques with graph theory algorithms. The experimental results reported in this paper confirm that LOUD can localize faults with high precision, by relying only on a lightweight positive training.

READ FULL TEXT

page 1

page 2

page 3

page 4

09/23/2021

Fault Localization in Cloud using Centrality Measures

Fault localization is an imperative method in fault tolerance in a distr...
09/05/2021

Tolerating Adversarial Attacks and Byzantine Faults in Distributed Machine Learning

Adversarial attacks attempt to disrupt the training, retraining and util...
11/21/2019

Predicting Failures in Multi-Tier Distributed Systems

Many applications are implemented as multi-tier software systems, and ar...
11/25/2019

Distortion and Faults in Machine Learning Software

Machine learning software, deep neural networks (DNN) software in partic...
10/20/2017

Hardened Paxos Through Consistency Validation

Due to the emergent adoption of distributed systems when building applic...
09/06/2021

Towards API Testing Across Cloud and Edge

API economy is driving the digital transformation of business applicatio...
04/11/2019

On Byzantine Fault Tolerance in Multi-Master Kubernertes Clusters

Docker container virtualization technology is being widely adopted in cl...