Addressing patient heterogeneity in disease predictive model development

01/07/2021
by   Xu Gao, et al.
0

This paper addresses patient heterogeneity associated with prediction problems in biomedical applications. We propose a systematic hypothesis testing approach to determine the existence of patient subgroup structure and the number of subgroups in patient population if subgroups exist. A mixture of generalized linear models is considered to model the relationship between the disease outcome and patient characteristics and clinical factors, including targeted biomarker profiles. We construct a test statistic based on expectation maximization (EM) algorithm and derive its asymptotic distribution under the null hypothesis. An important computational advantage of the test is that the involved parameter estimates under the complex alternative hypothesis can be obtained through a small number of EM iterations, rather than optimizing the objective function. We demonstrate the finite sample performance of the proposed test in terms of type-I error rate and power, using extensive simulation studies. The applicability of the proposed method is illustrated through an application to a multi-center prostate cancer study.

READ FULL TEXT
research
02/08/2019

Testing the Order of Multivariate Normal Mixture Models

Finite mixtures of multivariate normal distributions have been widely us...
research
04/29/2019

Individualized Treatment Selection: An Optimal Hypothesis Testing Approach In High-dimensional Models

The ability to predict individualized treatment effects (ITEs) based on ...
research
12/16/2019

Targeting the Uniformly Most Powerful Unbiased Test in Sample Size Reassessment Adaptive Clinical Trials with Deep Learning

In recent pharmaceutical drug development, adaptive clinical trials beco...
research
08/02/2020

A Stochastic EM Algorithm for Cure Rate Model with Negative Binomial Competing Risks and Non-homogeneous Lifetime

In this paper, we consider a long-term survival model under a competing ...
research
06/26/2022

The shared weighted Lindley frailty model for cluster failure time data

The primary goal of this paper is to introduce a novel frailty model bas...
research
08/09/2021

Test of Significance for High-dimensional Thresholds with Application to Individualized Minimal Clinically Important Difference

This work is motivated by learning the individualized minimal clinically...
research
08/06/2020

A stable and adaptive polygenic signal detection method based on repeated sample splitting

Using polygenic risk score for trait association analyses and disease pr...

Please sign up or login with your details

Forgot password? Click here to reset