Probabilistic Canonical Correlation Analysis for Sparse Count Data

05/11/2020
by   Lin Qiu, et al.
0

Canonical correlation analysis (CCA) is a classical and important multivariate technique for exploring the relationship between two sets of continuous variables. CCA has applications in many fields, such as genomics and neuroimaging. It can extract meaningful features as well as use these features for subsequent analysis. Although some sparse CCA methods have been developed to deal with high-dimensional problems, they are designed specifically for continuous data and do not consider the integer-valued data from next-generation sequencing platforms that exhibit very low counts for some important features. We propose a model-based probabilistic approach for correlation and canonical correlation estimation for two sparse count data sets (PSCCA). PSCCA demonstrates that correlations and canonical correlations estimated at the natural parameter level are more appropriate than traditional estimation methods applied to the raw data. We demonstrate through simulation studies that PSCCA outperforms other standard correlation approaches and sparse CCA approaches in estimating the true correlations and canonical correlations at the natural parameter level. We further apply the PSCCA method to study the association of miRNA and mRNA expression data sets from a squamous cell lung cancer study, finding that PSCCA can uncover a large number of strongly correlated pairs than standard correlation and other sparse CCA approaches.

READ FULL TEXT
research
07/13/2018

Sparse semiparametric canonical correlation analysis for data of mixed types

Canonical correlation analysis investigates linear relationships between...
research
11/19/2015

Canonical Autocorrelation Analysis

We present an extension of sparse Canonical Correlation Analysis (CCA) d...
research
08/22/2023

Computational Inference for Directions in Canonical Correlation Analysis

Canonical Correlation Analysis (CCA) is a method for analyzing pairs of ...
research
06/18/2012

Sparse Additive Functional and Kernel CCA

Canonical Correlation Analysis (CCA) is a classical tool for finding cor...
research
02/03/2020

Common Information Components Analysis

We give an information-theoretic interpretation of Canonical Correlation...
research
11/03/2020

Canonical Correlation Analysis in high dimensions with structured regularization

Canonical Correlation Analysis (CCA) is a technique for measuring the as...
research
09/17/2019

BLOCCS: Block Sparse Canonical Correlation Analysis With Application To Interpretable Omics Integration

We introduce Block Sparse Canonical Correlation Analysis which estimates...

Please sign up or login with your details

Forgot password? Click here to reset