Identifying Outliers using Influence Function of Multiple Kernel Canonical Correlation Analysis

06/01/2016
by   Md. Ashad Alam, et al.
0

Imaging genetic research has essentially focused on discovering unique and co-association effects, but typically ignoring to identify outliers or atypical objects in genetic as well as non-genetics variables. Identifying significant outliers is an essential and challenging issue for imaging genetics and multiple sources data analysis. Therefore, we need to examine for transcription errors of identified outliers. First, we address the influence function (IF) of kernel mean element, kernel covariance operator, kernel cross-covariance operator, kernel canonical correlation analysis (kernel CCA) and multiple kernel CCA. Second, we propose an IF of multiple kernel CCA, which can be applied for more than two datasets. Third, we propose a visualization method to detect influential observations of multiple sources of data based on the IF of kernel CCA and multiple kernel CCA. Finally, the proposed methods are capable of analyzing outliers of subjects usually found in biomedical applications, in which the number of dimension is large. To examine the outliers, we use the stem-and-leaf display. Experiments on both synthesized and imaging genetics data (e.g., SNP, fMRI, and DNA methylation) demonstrate that the proposed visualization can be applied effectively.

READ FULL TEXT

page 10

page 11

research
05/09/2017

Influence Function and Robust Variant of Kernel Canonical Correlation Analysis

Many unsupervised kernel methods rely on the estimation of the kernel co...
research
09/15/2016

Learning Schizophrenia Imaging Genetics Data Via Multiple Kernel Canonical Correlation Analysis

Kernel and Multiple Kernel Canonical Correlation Analysis (CCA) are empl...
research
02/17/2016

Robust Kernel (Cross-) Covariance Operators in Reproducing Kernel Hilbert Space toward Kernel Methods

To the best of our knowledge, there are no general well-founded robust m...
research
06/01/2016

Gene-Gene association for Imaging Genetics Data using Robust Kernel Canonical Correlation Analysis

In genome-wide interaction studies, to detect gene-gene interactions, mo...
research
03/05/2015

Pyrcca: regularized kernel canonical correlation analysis in Python and its applications to neuroimaging

Canonical correlation analysis (CCA) is a valuable method for interpreti...
research
11/06/2018

Robust multiple-set linear canonical analysis based on minimum covariance determinant estimator

By deriving influence functions related to multiple-set linear canonical...
research
02/24/2017

Characterizing Classes of Potential Outliers through Traffic Data Set Data Signature 2D nMDS Projection

This paper presents a formal method for characterizing the potential out...

Please sign up or login with your details

Forgot password? Click here to reset