A Missing Value Filling Model Based on Feature Fusion Enhanced Autoencoder

08/29/2022
by   Xinyao Liu, et al.
0

With the advent of the big data era, the data quality problem is becoming more and more crucial. Among many factors, data with missing values is one primary issue, and thus developing effective imputation models is a key topic in the research community. Recently, a major research direction is to employ neural network models such as selforganizing mappings or automatic encoders for filling missing values. However, these classical methods can hardly discover correlation features and common features simultaneously among data attributes. Especially,it is a very typical problem for classical autoencoders that they often learn invalid constant mappings, thus dramatically hurting the filling performance. To solve the above problems, we propose and develop a missing-value-filling model based on a feature-fusion-enhanced autoencoder. We first design and incorporate into an autoencoder a hidden layer that consists of de-tracking neurons and radial basis function neurons, which can enhance the ability to learn correlated features and common features. Besides, we develop a missing value filling strategy based on dynamic clustering (MVDC) that is incorporated into an iterative optimization process. This design can enhance the multi-dimensional feature fusion ability and thus improves the dynamic collaborative missing-value-filling performance. The effectiveness of our model is validated by experimental comparisons to many missing-value-filling methods that are tested on seven datasets with different missing rates.

READ FULL TEXT
research
12/23/2020

IFGAN: Missing Value Imputation using Feature-specific Generative Adversarial Networks

Missing value imputation is a challenging and well-researched topic in d...
research
06/26/2021

FCMI: Feature Correlation based Missing Data Imputation

Processed data are insightful, and crude data are obtuse. A serious thre...
research
02/26/2019

Optimal Clustering with Missing Values

Missing values frequently arise in modern biomedical studies due to vari...
research
07/22/2013

Performance comparison of State-of-the-art Missing Value Imputation Algorithms on Some Bench mark Datasets

Decision making from data involves identifying a set of attributes that ...
research
02/17/2022

A Machine Learning Approach for Automated Filling of Data Entry Forms

Users frequently interact with software systems through data entry forms...
research
08/05/2018

Missing Value Imputation Based on Deep Generative Models

Missing values widely exist in many real-world datasets, which hinders t...
research
05/11/2020

SCAT: Second Chance Autoencoder for Textual Data

We present a k-competitive learning approach for textual autoencoders na...

Please sign up or login with your details

Forgot password? Click here to reset