Data Integrity Error Localization in Networked Systems with Missing Data

07/05/2022
by   Yufeng Xin, et al.
0

Most recent network failure diagnosis systems focused on data center networks where complex measurement systems can be deployed to derive routing information and ensure network coverage in order to achieve accurate and fast fault localization. In this paper, we target wide-area networks that support data-intensive distributed applications. We first present a new multi-output prediction model that directly maps the application level observations to localize the system component failures. In reality, this application-centric approach may face the missing data challenge as some input (feature) data to the inference models may be missing due to incomplete or lost measurements in wide area networks. We show that the presented prediction model naturally allows the multivariate imputation to recover the missing data. We evaluate multiple imputation algorithms and show that the prediction performance can be improved significantly in a large-scale network. As far as we know, this is the first study on the missing data issue and applying imputation techniques in network failure localization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2020

Handling Missing Data with Graph Representation Learning

Machine learning with missing data has been approached in two different ...
research
08/03/2023

Diffusion-based Time Series Data Imputation for Microsoft 365

Reliability is extremely important for large-scale cloud systems like Mi...
research
01/19/2017

Random Forest Missing Data Algorithms

Random forest (RF) missing data algorithms are an attractive approach fo...
research
01/02/2020

Using Data Imputation for Signal Separation in High Contrast Imaging

To characterize circumstellar systems in high contrast imaging, the fund...
research
01/28/2019

CollaGAN : Collaborative GAN for Missing Image Data Imputation

In many applications requiring multiple inputs to obtain a desired outpu...
research
04/18/2021

Multi-objective Feature Selection with Missing Data in Classification

Feature selection (FS) is an important research topic in machine learnin...

Please sign up or login with your details

Forgot password? Click here to reset