Statistical Inference for Streamed Longitudinal Data

08/04/2022
by   Lan Luo, et al.
0

Modern longitudinal data, for example from wearable devices, measures biological signals on a fixed set of participants at a diverging number of time points. Traditional statistical methods are not equipped to handle the computational burden of repeatedly analyzing the cumulatively growing dataset each time new data is collected. We propose a new estimation and inference framework for dynamic updating of point estimates and their standard errors across serially collected dependent datasets. The key technique is a decomposition of the extended score function of the quadratic inference function constructed over the cumulative longitudinal data into a sum of summary statistics over data batches. We show how this sum can be recursively updated without the need to access the whole dataset, resulting in a computationally efficient streaming procedure with minimal loss of statistical efficiency. We prove consistency and asymptotic normality of our streaming estimator as the number of data batches diverges, even as the number of independent participants remains fixed. Simulations highlight the advantages of our approach over traditional statistical methods that assume independence between data batches. Finally, we investigate the relationship between physical activity and several diseases through the analysis of accelerometry data from the National Health and Nutrition Examination Survey.

READ FULL TEXT
research
07/26/2022

Functional Regression with Intensively Measured Longitudinal Outcomes: A New Lens through Data Partitioning

Modern longitudinal data from wearable devices consist of biological sig...
research
06/30/2021

Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches

This paper develops an incremental learning algorithm based on quadratic...
research
05/02/2021

Variable selection for longitudinal survey data

In this article we propose a new variable selection method for analyzing...
research
11/02/2021

Dynamic statistical inference in massive datastreams

Modern technological advances have expanded the scope of applications re...
research
06/10/2021

Online Debiased Lasso

We propose an online debiased lasso (ODL) method for statistical inferen...
research
12/09/2021

Multi-Kink Quantile Regression for Longitudinal Data with Application to the Progesterone Data Analysis

Motivated by investigating the relationship between progesterone and the...
research
06/18/2021

Bayesian Cox Regression for Population-scale Inference in Electronic Health Records

The Cox model is an indispensable tool for time-to-event analysis, parti...

Please sign up or login with your details

Forgot password? Click here to reset