Missing Value Imputation With Unsupervised Backpropagation

12/19/2013
by   Michael S. Gashler, et al.
0

Many data mining and data analysis techniques operate on dense matrices or complete tables of data. Real-world data sets, however, often contain unknown values. Even many classification algorithms that are designed to operate with missing values still exhibit deteriorated accuracy. One approach to handling missing values is to fill in (impute) the missing values. In this paper, we present a technique for unsupervised learning called Unsupervised Backpropagation (UBP), which trains a multi-layer perceptron to fit to the manifold sampled by a set of observed point-vectors. We evaluate UBP with the task of imputing missing values in datasets, and show that UBP is able to predict missing values with significantly lower sum-squared error than other collaborative filtering and imputation techniques. We also demonstrate with 24 datasets and 9 supervised learning algorithms that classification accuracy is usually higher when randomly-withheld values are imputed using UBP, rather than with other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2009

Using Association Rules for Better Treatment of Missing Values

The quality of training data for knowledge discovery in databases (KDD) ...
research
02/28/2022

Missing Value Estimation using Clustering and Deep Learning within Multiple Imputation Framework

Missing values in tabular data restrict the use and performance of machi...
research
07/22/2013

Performance comparison of State-of-the-art Missing Value Imputation Algorithms on Some Bench mark Datasets

Decision making from data involves identifying a set of attributes that ...
research
04/21/2009

Introducing Partial Matching Approach in Association Rules for Better Treatment of Missing Values

Handling missing values in training datasets for constructing learning m...
research
03/21/2015

Fast Imbalanced Classification of Healthcare Data with Missing Values

In medical domain, data features often contain missing values. This can ...
research
06/30/2020

Tomographic Auto-Encoder: Unsupervised Bayesian Recovery of Corrupted Data

We propose a new probabilistic method for unsupervised recovery of corru...
research
02/08/2016

Multi-view Kernel Completion

In this paper, we introduce the first method that (1) can complete kerne...

Please sign up or login with your details

Forgot password? Click here to reset