A Pan-Cancer and Polygenic Bayesian Hierarchical Model for the Effect of Somatic Mutations on Survival

10/08/2019
by   Sarah Samorodnitsky, et al.
0

We built a novel Bayesian hierarchical survival model based on the somatic mutation profile of patients across 50 genes and 27 cancer types. The pan-cancer quality allows for the model to "borrow" information across cancer types, motivated by the assumption that similar mutation profiles may have similar (but not necessarily identical) effects on survival across different tissues-of-origin or tumor types. The effect of a mutation at each gene was allowed to vary by cancer type while the mean effect of each gene was shared across cancers. Within this framework we considered four parametric survival models (normal, log-normal, exponential, and Weibull), and we compared their performance via a cross-validation approach in which we fit each model on training data and estimate the log-posterior predictive likelihood on test data. The log-normal model gave the best fit, and we investigated the partial effect of each gene on survival via a forward selection procedure. Through this we determined that mutations at TP53 and FAT4 were together the most useful for predicting patient survival. We validated the model via simulation to ensure that our algorithm for posterior computation gave nominal coverage rates. The code used for this analysis can be found at http://github.com/sarahsamorodnitsky/Pan-Cancer-Survival-Modeling , and the results are at http://ericfrazerlock.com/surv_figs/SurvivalDisplay.html .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2021

Study of the Parent-of-origin effect in monogenic diseases with variable age of onset. Application on ATTRv

In genetic diseases with variable age of onset, an accurate estimation o...
research
02/02/2022

MPVNN: Mutated Pathway Visible Neural Network Architecture for Interpretable Prediction of Cancer-specific Survival Risk

Survival risk prediction using gene expression data is important in maki...
research
05/21/2020

Correlated Mixed Membership Modeling of Somatic Mutations

Recent studies of cancer somatic mutation profiles seek to identify muta...
research
02/08/2021

Data-driven design of targeted gene panels for estimating immunotherapy biomarkers

We introduce a novel data-driven framework for the design of targeted ge...
research
12/06/2021

Bayesian Structural Equation Modeling in Multiple Omics Data Integration with Application to Circadian Genes

It is well known that the integration among different data-sources is re...
research
04/12/2013

Identifying cancer subtypes in glioblastoma by combining genomic, transcriptomic and epigenomic data

We present a nonparametric Bayesian method for disease subtype discovery...
research
07/05/2013

Supervised Learning and Anti-learning of Colorectal Cancer Classes and Survival Rates from Cellular Biology Parameters

In this paper, we describe a dataset relating to cellular and physical c...

Please sign up or login with your details

Forgot password? Click here to reset