Incorporating External Risk Information with the Cox Model under Population Heterogeneity: Applications to Trans-Ancestry Polygenic Hazard Scores

02/22/2023
by   Di Wang, et al.
0

Polygenic hazard score (PHS) models designed for European ancestry (EUR) individuals provide ample information regarding survival risk discrimination. Incorporating such information can improve the performance of risk discrimination in an internal small-sized non-EUR cohort. However, given that external EUR-based model and internal individual-level data come from different populations, ignoring population heterogeneity can introduce substantial bias. In this paper, we develop a Kullback-Leibler-based Cox model (CoxKL) to integrate internal individual-level time-to-event data with external risk scores derived from published prediction models, accounting for population heterogeneity. Partial-likelihood-based KL information is utilized to measure the discrepancy between the external risk information and the internal data. We establish the asymptotic properties of the CoxKL estimator. Simulation studies show that the integration model by the proposed CoxKL method achieves improved estimation efficiency and prediction accuracy. We applied the proposed method to develop a trans-ancestry PHS model for prostate cancer and found that integrating a previously published EUR-based PHS with an internal genotype data of African ancestry (AFR) males yielded considerable improvement on the prostate cancer risk discrimination.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2021

Regression inference for multiple populations by integrating summary-level data using stacked imputations

There is a growing need for flexible general frameworks that integrate i...
research
10/12/2022

Bregman Divergence-Based Data Integration with Application to Polygenic Risk Score (PRS) Heterogeneity Adjustment

Polygenic risk scores (PRS) have recently received much attention for ge...
research
08/10/2022

KL-divergence Based Deep Learning for Discrete Time Model

Neural Network (Deep Learning) is a modern model in Artificial Intellige...
research
10/20/2020

An ensemble meta-prediction framework to integrate multiple external models into a current study

Disease risk prediction models are used throughout clinical biomedicine....
research
05/10/2022

Improving genetic risk prediction across diverse population by disentangling ancestry representations

Risk prediction models using genetic data have seen increasing traction ...
research
08/27/2021

Targeting Underrepresented Populations in Precision Medicine: A Federated Transfer Learning Approach

The limited representation of minorities and disadvantaged populations i...
research
03/04/2020

Risk Projection for Time-to-event Outcome Leveraging Summary Statistics With Source Individual-level Data

Predicting risks of chronic diseases has become increasingly important i...

Please sign up or login with your details

Forgot password? Click here to reset