The Effect of Sample Size and Missingness on Inference with Missing Data

12/17/2021
by   Julian Morimoto, et al.
0

When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This paper answers this question by offering a new asymptotic theory about inference with missing data that is more general than existing theories. By using more powerful tools from real analysis and probability theory than those used in previous research, it proves that as the sample size increases and the extent of missingness decreases, the average-loglikelihood function generated by partial data and that ignores the missingness mechanism will almost surely converge uniformly to that which would have been generated by complete data; and if the data are Missing at Random, this convergence depends only on sample size. Thus, inferences from partial data, such as posterior modes, uncertainty estimates, confidence intervals, likelihood ratios, test statistics, and indeed, all quantities or features derived from the partial-data loglikelihood function, will be consistently estimated. They will approximate their complete-data analogues. This adds to previous research which has only proved the consistency and asymptotic normality of the posterior mode, and developed separate theories for Direct-Likelihood, Bayesian, and Frequentist inference. Practical implications of this result are discussed, and the theory is verified using a previous study of International Human Rights Law.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

Estimating Viral Genetic Linkage Rates in the Presence of Missing Data

Although the interest in the the use of social and information networks ...
research
10/12/2017

Inference for partial correlation when data are missing not at random

We introduce uncertainty regions to perform inference on partial correla...
research
05/02/2019

Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

How does missing data affect our ability to learn signal structures? It ...
research
03/31/2023

Second Term Improvement to Generalised Linear Mixed Model Asymptotics

A recent article on generalised linear mixed model asymptotics, Jiang et...
research
09/26/2013

Estimating Undirected Graphs Under Weak Assumptions

We consider the problem of providing nonparametric confidence guarantees...
research
09/04/2023

Challenges of the inconsistency regime: Novel debiasing methods for missing data models

We study semi-parametric estimation of the population mean when data is ...
research
09/23/2022

Posterior Probabilities: Nonmonotonicity, Asymptotic Rates, Log-Concavity, and Turán's Inequality

In the standard Bayesian framework data are assumed to be generated by a...

Please sign up or login with your details

Forgot password? Click here to reset