Boosting the concordance index for survival data - a unified framework to derive and evaluate biomarker combinations

07/24/2013
by   Andreas Mayr, et al.
0

The development of molecular signatures for the prediction of time-to-event outcomes is a methodologically challenging task in bioinformatics and biostatistics. Although there are numerous approaches for the derivation of marker combinations and their evaluation, the underlying methodology often suffers from the problem that different optimization criteria are mixed during the feature selection, estimation and evaluation steps. This might result in marker combinations that are only suboptimal regarding the evaluation criterion of interest. To address this issue, we propose a unified framework to derive and evaluate biomarker combinations. Our approach is based on the concordance index for time-to-event data, which is a non-parametric measure to quantify the discrimatory power of a prediction rule. Specifically, we propose a component-wise boosting algorithm that results in linear biomarker combinations that are optimal with respect to a smoothed version of the concordance index. We investigate the performance of our algorithm in a large-scale simulation study and in two molecular data sets for the prediction of survival in breast cancer patients. Our numerical results show that the new approach is not only methodologically sound but can also lead to a higher discriminatory power than traditional approaches for the derivation of gene signatures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2023

Tutorial on survival modelling with omics data

Identification of genomic, molecular and clinical markers predictive of ...
research
04/10/2023

hist2RNA: An efficient deep learning architecture to predict gene expression from breast cancer histopathology images

Gene expression can be used to subtype breast cancer with improved predi...
research
02/01/2021

Computing the Hazard Ratios Associated with Explanatory Variables Using Machine Learning Models of Survival Data

Purpose: The application of Cox Proportional Hazards (CoxPH) models to s...
research
04/04/2018

Non-parametric cure rate estimation under insufficient follow-up using extremes

An important research topic in survival analysis is related to the model...
research
07/08/2014

iGPSe: A Visual Analytic System for Integrative Genomic Based Cancer Patient Stratification

Background: Cancers are highly heterogeneous with different subtypes. Th...
research
01/18/2010

Increasing stability and interpretability of gene expression signatures

Motivation : Molecular signatures for diagnosis or prognosis estimated f...
research
07/11/2022

Multi-Study Boosting: Theoretical Considerations for Merging vs. Ensembling

Cross-study replicability is a powerful model evaluation criterion that ...

Please sign up or login with your details

Forgot password? Click here to reset