Sparse semiparametric canonical correlation analysis for data of mixed types

07/13/2018
by   Grace Yoon, et al.
0

Canonical correlation analysis investigates linear relationships between two sets of variables, but often works poorly on modern data sets due to high-dimensionality and mixed data types (continuous/binary/zero-inflated). We propose a new approach for sparse canonical correlation analysis of mixed data types that does not require explicit parametric assumptions. Our main contribution is the use of truncated latent Gaussian copula to model the data with excess zeroes, which allows us to derive a rank-based estimator of latent correlation matrix without the estimation of marginal transformation functions. The resulting semiparametric sparse canonical correlation analysis method works well in high-dimensional settings as demonstrated via numerical studies, and application to the analysis of association between gene expression and micro RNA data of breast cancer patients.

READ FULL TEXT
research
05/11/2020

Probabilistic Canonical Correlation Analysis for Sparse Count Data

Canonical correlation analysis (CCA) is a classical and important multiv...
research
11/24/2013

Sparse CCA via Precision Adjusted Iterative Thresholding

Sparse Canonical Correlation Analysis (CCA) has received considerable at...
research
07/29/2022

Exponential canonical correlation analysis with orthogonal variation

Canonical correlation analysis (CCA) is a standard tool for studying ass...
research
11/19/2015

Canonical Autocorrelation Analysis

We present an extension of sparse Canonical Correlation Analysis (CCA) d...
research
08/20/2021

latentcor: An R Package for estimating latent correlations from mixed data types

We present `latentcor`, an R package for correlation estimation from dat...
research
06/10/2013

Discriminative extended canonical correlation analysis for pattern set matching

In this paper we address the problem of matching sets of vectors embedde...

Please sign up or login with your details

Forgot password? Click here to reset