EMFlow: Data Imputation in Latent Space via EM and Deep Flow Models

06/09/2021
by   Qi Ma, et al.
0

High dimensional incomplete data can be found in a wide range of systems. Due to the fact that most of the data mining techniques and machine learning algorithms require complete observations, data imputation is vital for down-stream analysis. In this work, we introduce an imputation approach, called EMFlow, that performs imputation in an latent space via an online version of Expectation-Maximization (EM) algorithm and connects the latent space and the data space via the normalizing flow (NF). The inference of EMFlow is iterative, involving updating the parameters of online EM and NF alternatively. Extensive experimental results on multivariate and image datasets show that the proposed EMFlow has superior performance to competing methods in terms of both imputation quality and convergence speed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2020

MCFlow: Monte Carlo Flow Models for Data Imputation

We consider the topic of data imputation, a foundational task in machine...
research
11/14/2022

Dealing with missing data using attention and latent space regularization

Most practical data science problems encounter missing data. A wide vari...
research
02/01/2018

Bootstrapping and Multiple Imputation Ensemble Approaches for Missing Data

Presence of missing values in a dataset can adversely affect the perform...
research
10/02/2018

Feature Selection Approach with Missing Values Conducted for Statistical Learning: A Case Study of Entrepreneurship Survival Dataset

In this article, we investigate the features which enhanced discriminate...
research
07/24/2019

The Virtual Patch Clamp: Imputing C. elegans Membrane Potentials from Calcium Imaging

We develop a stochastic whole-brain and body simulator of the nematode r...
research
09/09/2022

Boosting Sensitivity of Large-scale Online Experimentation via Dropout Buyer Imputation

Metrics provide strong evidence to support hypotheses in online experime...
research
12/09/2020

Hard and Soft EM in Bayesian Network Learning from Incomplete Data

Incomplete data are a common feature in many domains, from clinical tria...

Please sign up or login with your details

Forgot password? Click here to reset