Multi-Kernel LS-SVM Based Bio-Clinical Data Integration: Applications to Ovarian Cancer

04/10/2017
by   Jaya Thomas, et al.
0

The medical research facilitates to acquire a diverse type of data from the same individual for particular cancer. Recent studies show that utilizing such diverse data results in more accurate predictions. The major challenge faced is how to utilize such diverse data sets in an effective way. In this paper, we introduce a multiple kernel based pipeline for integrative analysis of high-throughput molecular data (somatic mutation, copy number alteration, DNA methylation and mRNA) and clinical data. We apply the pipeline on Ovarian cancer data from TCGA. After multiple kernels have been generated from the weighted sum of individual kernels, it is used to stratify patients and predict clinical outcomes. We examine the survival time, vital status, and neoplasm cancer status of each subtype to verify how well they cluster. We have also examined the power of molecular and clinical data in predicting dichotomized overall survival data and to classify the tumor grade for the cancer samples. It was observed that the integration of various data types yields higher log-rank statistics value. We were also able to predict clinical status with higher accuracy as compared to using individual data types.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2021

A Hierarchical Spike-and-Slab Model for Pan-Cancer Survival Using Pan-Omic Data

Pan-omics, pan-cancer analysis has advanced our understanding of the mol...
research
12/16/2019

Deep learning-based survival prediction for multiple cancer types using histopathology images

Prognostic information at diagnosis has important implications for cance...
research
11/17/2016

A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival

Cancer survival prediction is an active area of research that can help p...
research
03/11/2018

A pathway-based kernel boosting method for sample classification using genomic data

The analysis of cancer genomic data has long suffered "the curse of dime...
research
12/21/2018

Pan-Cancer Epigenetic Biomarker Selection from Blood Samples Using SAS

A key focus in current cancer research is the discovery of cancer biomar...
research
08/24/2019

Geographically Weighted Cox Regression for Prostate Cancer Survival Data in Louisiana

The Cox proportional hazard model is one of the most popular tools in an...
research
01/23/2023

Maximum Mean Discrepancy Kernels for Predictive and Prognostic Modeling of Whole Slide Images

How similar are two images? In computational pathology, where Whole Slid...

Please sign up or login with your details

Forgot password? Click here to reset