Towards multiple kernel principal component analysis for integrative analysis of tumor samples

01/02/2017
by   Nora K. Speicher, et al.
0

Personalized treatment of patients based on tissue-specific cancer subtypes has strongly increased the efficacy of the chosen therapies. Even though the amount of data measured for cancer patients has increased over the last years, most cancer subtypes are still diagnosed based on individual data sources (e.g. gene expression data). We propose an unsupervised data integration method based on kernel principal component analysis. Principal component analysis is one of the most widely used techniques in data analysis. Unfortunately, the straight-forward multiple-kernel extension of this method leads to the use of only one of the input matrices, which does not fit the goal of gaining information from all data sources. Therefore, we present a scoring function to determine the impact of each input matrix. The approach enables visualizing the integrated data and subsequent clustering for cancer subtype identification. Due to the nature of the method, no free parameters have to be set. We apply the methodology to five different cancer data sets and demonstrate its advantages in terms of results and usability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2015

Kernel principal component analysis network for image classification

In order to classify the nonlinear feature with linear classifier and im...
research
02/19/2014

A Statistical Approach to Set Classification by Feature Selection with Applications to Classification of Histopathology Images

Set classification problems arise when classification tasks are based on...
research
11/18/2020

Voxelwise principal component analysis of dynamic [S-methyl-11C]methionine PET data in glioma patients

Recent works have demonstrated the added value of dynamic amino acid pos...
research
02/12/2020

Structure-Property Maps with Kernel Principal Covariates Regression

Data analysis based on linear methods, which look for correlations betwe...
research
04/17/2008

Information Preserving Component Analysis: Data Projections for Flow Cytometry Analysis

Flow cytometry is often used to characterize the malignant cells in leuk...
research
08/14/2022

Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves

Cosmological shock waves are essential to understanding the formation of...
research
12/16/2015

Streaming Kernel Principal Component Analysis

Kernel principal component analysis (KPCA) provides a concise set of bas...

Please sign up or login with your details

Forgot password? Click here to reset