Analysis of Two-Phase Studies using Generalized Method of Moments

10/26/2019
by   Prosenjit Kundu, et al.
0

Two-phase design can reduce the cost of epidemiological studies by limiting the ascertainment of expensive covariates or/and exposures to an efficiently selected subset (phase-II) of a larger (phase-I) study. Efficient analysis of the resulting dataset combining disparate information from phase-I and phase-II, however, can be complex. Most of the existing methods including semiparametric maximum-likelihood estimator, require the information in phase-I to be summarized into a fixed number of strata. In this paper, we describe a novel method for analysis of two-phase studies where information from phase-I is summarized by parameters associated with a reduced logistic regression model of the disease outcome on available covariates. We then setup estimating equations for parameters associated with the desired extended logistic regression model, based on information on the reduced model parameters from phase-I and complete data available at phase-II after accounting for non-random sampling design at phase-II. We use the generalized method of moments to solve overly identified estimating equations and develop the resulting asymptotic theory for the proposed estimator. Simulation studies show that the use of reduced parametric models, as opposed to summarizing data into strata, can lead to more efficient utilization of phase-I data. An application of the proposed method is illustrated using the US National Wilms Tumor study data.

READ FULL TEXT

page 8

page 9

research
03/03/2023

Estimation of logistic regression parameters for complex survey data: a real data based simulation study

In complex survey data, each sampled observation has assigned a sampling...
research
10/07/2021

High Dimensional Logistic Regression Under Network Dependence

Logistic regression is one of the most fundamental methods for modeling ...
research
06/02/2021

Combining case-control studies for identifiability and efficiency improvement in logistic regression

Can two separate case-control studies, one about Hepatitis disease and t...
research
12/19/2022

Improving Estimation Efficiency for Two-Phase, Outcome-Dependent Sampling Studies

Two-phase outcome dependent sampling (ODS) is widely used in many fields...
research
03/21/2022

Choosing good subsamples for regression modelling

A common problem in health research is that we have a large database wit...
research
04/13/2022

Recurrent event analysis in the presence of real-time high frequency data via random subsampling

Digital monitoring studies collect real-time high frequency data via mob...
research
02/16/2023

Augmented two-step estimating equations with nuisance functionals and complex survey data

Statistical inference in the presence of nuisance functionals with compl...

Please sign up or login with your details

Forgot password? Click here to reset