Imputing missing values with unsupervised random trees

11/15/2019
by   David Cortes, et al.
21

This work proposes a non-iterative strategy for missing value imputations which is guided by similarity between observations, but instead of explicitly determining distances or nearest neighbors, it assigns observations to overlapping buckets through recursive semi-random hyperplane cuts, in which weighted averages are determined as imputations for each variable. The quality of these imputations is oftentimes not as good as that of chained equations, but the proposed technique is much faster, non-iterative, can make imputations on new data without re-calculating anything, and scales easily to large and high-dimensional datasets, providing a significant boost over simple mean/median imputation in regression and classification metrics with imputed values when other methods are not feasible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2023

Chains of Autoreplicative Random Forests for missing value imputation in high-dimensional datasets

Missing values are a common problem in data science and machine learning...
research
06/28/2022

No imputation without representation

By filling in missing values in datasets, imputation allows these datase...
research
08/13/2016

An approach to dealing with missing values in heterogeneous data using k-nearest neighbors

Techniques such as clusterization, neural networks and decision making u...
research
01/14/2022

Imputing Missing Observations with Time Sliced Synthetic Minority Oversampling Technique

We present a simple yet novel time series imputation technique with the ...
research
09/07/2018

Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations

Despite the large body of research on missing value distributions and im...
research
01/24/2022

Imputing Missing Values in the Occupational Requirements Survey

The U.S. Bureau of Labor Statistics allows public access to much of the ...

Please sign up or login with your details

Forgot password? Click here to reset