Explainable Data Imputation using Constraints

05/10/2022
by   Sandeep Hans, et al.
0

Data values in a dataset can be missing or anomalous due to mishandling or human error. Analysing data with missing values can create bias and affect the inferences. Several analysis methods, such as principle components analysis or singular value decomposition, require complete data. Many approaches impute numeric data and some do not consider dependency of attributes on other attributes, while some require human intervention and domain knowledge. We present a new algorithm for data imputation based on different data type values and their association constraints in data, which are not handled currently by any system. We show experimental results using different metrics comparing our algorithm with state of the art imputation techniques. Our algorithm not only imputes the missing values but also generates human readable explanations describing the significance of attributes used for every imputation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2021

FCMI: Feature Correlation based Missing Data Imputation

Processed data are insightful, and crude data are obtuse. A serious thre...
research
04/06/2020

Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems

Dealing with missing data in data analysis is inevitable. Although power...
research
04/28/2023

Counterfactual Explanation with Missing Values

Counterfactual Explanation (CE) is a post-hoc explanation method that pr...
research
04/30/2018

Imputation of mixed data with multilevel singular value decomposition

Statistical analysis of large data sets offers new opportunities to bett...
research
12/23/2022

The Consistency of Probabilistic Databases with Independent Cells

A probabilistic database with attribute-level uncertainty consists of re...
research
09/21/2021

Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

We investigate the fairness concerns of training a machine learning mode...
research
10/04/2021

Internal Data Imputation in Data Warehouse Dimensions

Missing values occur commonly in the multidimensional data warehouses. T...

Please sign up or login with your details

Forgot password? Click here to reset