Identifying cancer subtypes in glioblastoma by combining genomic, transcriptomic and epigenomic data

04/12/2013
by   Richard S. Savage, et al.
0

We present a nonparametric Bayesian method for disease subtype discovery in multi-dimensional cancer data. Our method can simultaneously analyse a wide range of data types, allowing for both agreement and disagreement between their underlying clustering structure. It includes feature selection and infers the most likely number of disease subtypes, given the data. We apply the method to 277 glioblastoma samples from The Cancer Genome Atlas, for which there are gene expression, copy number variation, methylation and microRNA data. We identify 8 distinct consensus subtypes and study their prognostic value for death, new tumour events, progression and recurrence. The consensus subtypes are prognostic of tumour recurrence (log-rank p-value of 3.6 × 10^-4 after correction for multiple hypothesis tests). This is driven principally by the methylation data (log-rank p-value of 2.0 × 10^-3) but the effect is strengthened by the other 3 data types, demonstrating the value of integrating multiple data types. Of particular note is a subtype of 47 patients characterised by very low levels of methylation. This subtype has very low rates of tumour recurrence and no new events in 10 years of follow up. We also identify a small gene expression subtype of 6 patients that shows particularly poor survival outcomes. Additionally, we note a consensus subtype that showly a highly distinctive data signature and suggest that it is therefore a biologically distinct subtype of glioblastoma. The code is available from https://sites.google.com/site/multipledatafusion/

READ FULL TEXT

page 7

page 11

page 12

research
08/30/2018

Gaussian process regression for survival time prediction with genome-wide gene expression

Predicting the survival time of a cancer patient based on his/her genome...
research
11/01/2019

Analysis of Genomic and Transcriptomic Variations as Prognostic Signature for Lung Adenocarcinoma

Lung cancer is the leading cause of the largest number of deaths worldwi...
research
11/01/2019

Meta-Analysis of Genomic and Transcriptomic Variations in Lung Adenocarcinoma

Lung cancer is the leading cause of the largest number of deaths worldwi...
research
11/10/2017

A Novel Bayesian Multiple Testing Approach to Deregulated miRNA Discovery Harnessing Positional Clustering

MicroRNAs (miRNAs) are endogenous, small non-coding RNAs that function a...
research
10/08/2019

A Pan-Cancer and Polygenic Bayesian Hierarchical Model for the Effect of Somatic Mutations on Survival

We built a novel Bayesian hierarchical survival model based on the somat...
research
01/17/2019

Learning a Generative Model of Cancer Metastasis

We introduce a Unified Disentanglement Network (UFDN) trained on The Can...
research
10/02/2022

Modeling of Whole Genomic Sequencing Implementation using System Dynamics and Game Theory

Biomarker testing is a laboratory test in oncology that is used in the s...

Please sign up or login with your details

Forgot password? Click here to reset