Log In Sign Up

DPER: Efficient Parameter Estimation for Randomly Missing Data

by   Thu Nguyen, et al.

The missing data problem has been broadly studied in the last few decades and has various applications in different areas such as statistics or bioinformatics. Even though many methods have been developed to tackle this challenge, most of those are imputation techniques that require multiple iterations through the data before yielding convergence. In addition, such approaches may introduce extra biases and noises to the estimated parameters. In this work, we propose novel algorithms to find the maximum likelihood estimates (MLEs) for a one-class/multiple-class randomly missing data set under some mild assumptions. As the computation is direct without any imputation, our algorithms do not require multiple iterations through the data, thus promising to be less time-consuming than other methods while maintaining superior estimation performance. We validate these claims by empirical results on various data sets of different sizes and release all codes in a GitHub repository to contribute to the research community related to this problem.


page 1

page 2

page 3

page 4


On a simultaneous parameter inference and missing data imputation for nonstationary autoregressive models

This work addresses the problem of missing data in time-series analysis ...

Principle Components Analysis based frameworks for efficient missing data imputation algorithms

Missing data is a commonly occurring problem in practice, and imputation...

Random Forest Missing Data Algorithms

Random forest (RF) missing data algorithms are an attractive approach fo...

Inference with Imputed Data: The Allure of Making Stuff Up

Incomplete observability of data generates an identification problem. Th...

Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms

In the last couple of decades, there has been major advancements in the ...

Rough Sets Computations to Impute Missing Data

Many techniques for handling missing data have been proposed in the lite...

Multi-objective Feature Selection with Missing Data in Classification

Feature selection (FS) is an important research topic in machine learnin...