FDR-Corrected Sparse Canonical Correlation Analysis with Applications to Imaging Genomics

05/11/2017
by   Alexej Gossmann, et al.
0

Reducing the number of false positive discoveries is presently one of the most pressing issues in the life sciences. It is of especially great importance for many applications in neuroimaging and genomics, where datasets are typically high-dimensional, which means that the number of explanatory variables exceeds the sample size. The false discovery rate (FDR) is a criterion that can be employed to address that issue. Thus it has gained great popularity as a tool for testing multiple hypotheses. Canonical correlation analysis (CCA) is a statistical technique that is used to make sense of the cross-correlation of two sets of measurements collected on the same set of samples (e.g., brain imaging and genomic data for the same mental illness patients), and sparse CCA extends the classical method to high-dimensional settings. Here we propose a way of applying the FDR concept to sparse CCA, and a method to control the FDR. The proposed FDR correction directly influences the sparsity of the solution, adapting it to the unknown true sparsity level. Theoretical derivation as well as simulation studies show that our procedure indeed keeps the FDR of the canonical vectors below a user-specified target level. We apply the proposed method to an imaging genomics dataset from the Philadelphia Neurodevelopmental Cohort. Our results link the brain connectivity profiles derived from brain activity during an emotion identification task, as measured by functional magnetic resonance imaging (fMRI), to the corresponding subjects' genomic data.

READ FULL TEXT
research
04/21/2020

Imbalanced Sparse Canonical Correlation Analysis

Classical canonical correlation analysis (CCA) requires matrices to be l...
research
04/01/2019

Multimodal Sparse Classifier for Adolescent Brain Age Prediction

The study of healthy brain development helps to better understand the br...
research
05/29/2016

A simple and provable algorithm for sparse diagonal CCA

Given two sets of variables, derived from a common set of samples, spars...
research
03/01/2021

Tangent functional canonical correlation analysis for densities and shapes, with applications to multimodal imaging data

It is quite common for functional data arising from imaging data to assu...
research
10/17/2014

Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification

In this paper, we consider voxel selection for functional Magnetic Reson...
research
04/09/2018

Cluster Failure Revisited: Impact of First Level Design and Data Quality on Cluster False Positive Rates

Methodological research rarely generates a broad interest, yet our work ...
research
08/22/2023

Computational Inference for Directions in Canonical Correlation Analysis

Canonical Correlation Analysis (CCA) is a method for analyzing pairs of ...

Please sign up or login with your details

Forgot password? Click here to reset