HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

06/15/2022
by   Daniel Jarrett, et al.
11

Consider the problem of imputing missing values in a dataset. One the one hand, conventional approaches using iterative imputation benefit from the simplicity and customizability of learning conditional distributions directly, but suffer from the practical requirement for appropriate model specification of each and every variable. On the other hand, recent methods using deep generative modeling benefit from the capacity and efficiency of learning with neural network function approximators, but are often difficult to optimize and rely on stronger data assumptions. In this work, we study an approach that marries the advantages of both: We propose *HyperImpute*, a generalized iterative imputation framework for adaptively and automatically configuring column-wise models and their hyperparameters. Practically, we provide a concrete implementation with out-of-the-box learners, optimizers, simulators, and extensible interfaces. Empirically, we investigate this framework via comprehensive experiments and sensitivities on a variety of public datasets, and demonstrate its ability to generate accurate imputations relative to a strong suite of benchmarks. Contrary to recent work, we believe our findings constitute a strong defense of the iterative imputation paradigm.

READ FULL TEXT

page 6

page 7

page 16

page 17

page 18

page 20

research
12/06/2022

Data Imputation with Iterative Graph Reconstruction

Effective data imputation demands rich latent “structure" discovery capa...
research
10/31/2022

Diffusion models for missing value imputation in tabular data

Missing value imputation in machine learning is the task of estimating t...
research
06/03/2021

Semi-supervised Conditional Density Estimation for Imputation and Classification of Incomplete Instances

Incomplete instances with various missing attributes in many real-world ...
research
04/23/2020

Influence of parallel computing strategies of iterative imputation of missing data: a case study on missForest

Machine learning iterative imputation methods have been well accepted by...
research
01/12/2018

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing dat...
research
10/22/2021

Missing the Point: Non-Convergence in Iterative Imputation Algorithms

Iterative imputation is a popular tool to accommodate missing data. Whil...

Please sign up or login with your details

Forgot password? Click here to reset