Full-semiparametric-likelihood-based inference for non-ignorable missing data

by   Yukun Liu, et al.

During the past few decades, missing-data problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities. By contrast, research into non-ignorable missing data problems is quite limited. The main difficulty in solving such problems is that the missing probability and the regression likelihood function are tangled together in the likelihood presentation, and the model parameters may not be identifiable even under strong parametric model assumptions. In this paper we discuss a semiparametric model for non-ignorable missing data and propose a maximum full semiparametric likelihood estimation method, which is an efficient combination of the parametric conditional likelihood and the marginal nonparametric biased sampling likelihood. The extra marginal likelihood contribution can not only produce efficiency gain but also identify the underlying model parameters without additional assumptions. We further show that the proposed estimators for the underlying parameters and the response mean are semiparametrically efficient. Extensive simulations and a real data analysis demonstrate the advantage of the proposed method over competing methods.



There are no comments yet.


page 1

page 2

page 3

page 4


On a simultaneous parameter inference and missing data imputation for nonstationary autoregressive models

This work addresses the problem of missing data in time-series analysis ...

Likelihood Estimation with Incomplete Array Variate Observations

Missing data is an important challenge when dealing with high dimensiona...

Semiparametric response model with nonignorable nonresponse

How to deal with nonignorable response is often a challenging problem en...

Maximum Likelihood Estimation for Multimodal Learning with Missing Modality

Multimodal learning has achieved great successes in many scenarios. Comp...

Estimating Gaussian Copulas with Missing Data

In this work we present a rigorous application of the Expectation Maximi...

A Pseudo-Marginal Metropolis-Hastings Algorithm for Estimating Generalized Linear Models in the Presence of Missing Data

The missing data issue often complicates the task of estimating generali...

Bayesian Nonparametric Models for Biomedical Data Analysis

In this dissertation, we develop nonparametric Bayesian models for biome...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.