FCMI: Feature Correlation based Missing Data Imputation

06/26/2021
by   Prateek Mishra, et al.
0

Processed data are insightful, and crude data are obtuse. A serious threat to data reliability is missing values. Such data leads to inaccurate analysis and wrong predictions. We propose an efficient technique to impute the missing value in the dataset based on correlation called FCMI (Feature Correlation based Missing Data Imputation). We have considered the correlation of the attributes of the dataset, and that is our central idea. Our proposed algorithm picks the highly correlated attributes of the dataset and uses these attributes to build a regression model whose parameters are optimized such that the correlation of the dataset is maintained. Experiments conducted on both classification and regression datasets show that the proposed imputation technique outperforms existing imputation algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2023

Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods

Correlation matrix visualization is essential for understanding the rela...
research
11/17/2015

Optimized Linear Imputation

Often in real-world datasets, especially in high dimensional data, some ...
research
05/10/2022

Explainable Data Imputation using Constraints

Data values in a dataset can be missing or anomalous due to mishandling ...
research
10/26/2022

Imputation of missing values in multi-view data

When missing values occur in multi-view data, all features in a view are...
research
11/05/2022

Towards a methodology for addressing missingness in datasets, with an application to demographic health datasets

Missing data is a common concern in health datasets, and its impact on g...
research
06/10/2022

Provable Guarantees for Sparsity Recovery with Deterministic Missing Data Patterns

We study the problem of consistently recovering the sparsity pattern of ...
research
08/29/2022

A Missing Value Filling Model Based on Feature Fusion Enhanced Autoencoder

With the advent of the big data era, the data quality problem is becomin...

Please sign up or login with your details

Forgot password? Click here to reset