Conditional variable screening for ultra-high dimensional longitudinal data with time interactions

06/15/2023
by   Andrea Bratsberg, et al.
0

In recent years we have been able to gather large amounts of genomic data at a fast rate, creating situations where the number of variables greatly exceeds the number of observations. In these situations, most models that can handle a moderately high dimension will now become computationally infeasible. Hence, there is a need for a pre-screening of variables to reduce the dimension efficiently and accurately to a more moderate scale. There has been much work to develop such screening procedures for independent outcomes. However, much less work has been done for high-dimensional longitudinal data, in which the observations can no longer be assumed to be independent. In addition, it is of interest to capture possible interactions between the genomic variable and time in many of these longitudinal studies. This calls for the development of new screening procedures for high-dimensional longitudinal data, where the focus is on interactions with time. In this work, we propose a novel conditional screening procedure that ranks variables according to the likelihood value at the maximum likelihood estimates in a semi-marginal linear mixed model, where the genomic variable and its interaction with time are included in the model. This is to our knowledge the first conditional screening approach for clustered data. We prove that this approach enjoys the sure screening property, and assess the finite sample performance of the method through simulations, with a comparison of an already existing screening approach based on generalized estimating equations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2011

Independent screening for single-index hazard rate models with ultra-high dimensional features

In data sets with many more features than observations, independent scre...
research
05/25/2020

Robust Sure Independence Screening for Non-polynomial dimensional Generalized Linear Models

We consider the problem of variable screening in ultra-high dimensional ...
research
04/30/2020

A robust variable screening procedure for ultra-high dimensional data

Variable selection in ultra-high dimensional regression problems has bec...
research
06/23/2022

High-dimensional Variable Screening via Conditional Martingale Difference Divergence

Variable screening has been a useful research area that helps to deal wi...
research
07/21/2021

Bayesian iterative screening in ultra-high dimensional settings

Variable selection in ultra-high dimensional linear regression is often ...
research
10/24/2019

Conditional variable screening via ordinary least squares projection

To deal with the growing challenge from high dimensional data, we propos...
research
02/10/2019

BOLT-SSI: A Statistical Approach to Screening Interaction Effects for Ultra-High Dimensional Data

Detecting interaction effects is a crucial step in various applications....

Please sign up or login with your details

Forgot password? Click here to reset