Estimating Viral Genetic Linkage Rates in the Presence of Missing Data

03/24/2022
by   Tyler Vu, et al.
0

Although the interest in the the use of social and information networks has grown, most inferences on networks assume the data collected represents the complete. However, when ignoring missing data, even when missing completely at random, this results in bias for estimators regarding inference network related parameters. In this paper, we focus on constructing estimators for the probability that a randomly selected node has node has at least one edge under the assumption that nodes are missing completely at random along with their corresponding edges. In addition, issues also arise in obtaining asymptotic properties for such estimators, because linkage indicators across nodes are correlated preventing the direct application of the Central Limit Theorem and Law of Large Numbers. Using a subsampling approach, we present an improved estimator for our parameter of interest that accommodates for missing data. Utilizing the theory U-statistics, we derive consistency and asymptotic normality of the proposed estimator. This approach decreases the bias in estimating our parameter of interest. We illustrate our approach using the HIV viral strains from a large cluster-randomized trial of a combination HIV prevention intervention – the Botswana Combination Prevention Project (BCPP).

READ FULL TEXT
research
07/05/2022

Handling Nonmonotone Missing Data with Available Complete-Case Missing Value Assumption

Nonmonotone missing data is a common problem in scientific studies. The ...
research
03/28/2019

Consistency and Asymptotic Normality of Stochastic Block Models Estimators from Sampled Data

Statistical analysis of network is an active research area and the liter...
research
04/09/2023

Convergent estimators of variance of a spatial mean in the presence of missing observations

In the geosciences, a recurring problem is one of estimating spatial mea...
research
12/17/2021

The Effect of Sample Size and Missingness on Inference with Missing Data

When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist...
research
10/30/2018

Semiparametric response model with nonignorable nonresponse

How to deal with nonignorable response is often a challenging problem en...
research
11/21/2017

Partially Observed Functional Data: The Case of Systematically Missing Parts

By using a detour via the fundamental theorem of calculus, we propose a ...
research
11/30/2021

Nonparametric Methods for Complex Multivariate Data: Asymptotics and Small Sample Approximations

Quality of Life (QOL) outcomes are important in the management of chroni...

Please sign up or login with your details

Forgot password? Click here to reset