Fast computation of latent correlations

06/24/2020
by   Grace Yoon, et al.
0

Latent Gaussian copula models provide a powerful means to perform multi-view data integration since these models can seamlessly express dependencies between mixed variable types (binary, continuous, zero-inflated) via latent Gaussian correlations. The estimation of these latent correlations, however, comes at considerable computational cost, having prevented the routine use of these models on high-dimensional data. Here, we propose a new computational approach for estimating latent correlations via a hybrid multi-linear interpolation and optimization scheme. Our approach speeds up the current state of the art computation by several orders of magnitude, thus allowing fast computation of latent Gaussian copula models even when the number of variables p is large. We provide theoretical guarantees for the approximation error of our numerical scheme and support its excellent performance on simulated and real-world data. We illustrate the practical advantages of our method on high-dimensional sparse quantitative and relative abundance microbiome data as well as multi-view data from The Cancer Genome Atlas Project. Our method is implemented in the R package mixedCCA, available at https://github.com/irinagain/mixedCCA.

READ FULL TEXT

page 23

page 24

research
08/20/2021

latentcor: An R Package for estimating latent correlations from mixed data types

We present `latentcor`, an R package for correlation estimation from dat...
research
09/30/2022

Parea: multi-view ensemble clustering for cancer subtype discovery

Multi-view clustering methods are essential for the stratification of pa...
research
04/11/2014

Model Based Clustering of High-Dimensional Binary Data

We propose a mixture of latent trait models with common slope parameters...
research
04/12/2022

Multi-View Breast Cancer Classification via Hypercomplex Neural Networks

Traditionally, deep learning-based methods for breast cancer classificat...
research
04/27/2021

Multi-view Deep One-class Classification: A Systematic Exploration

One-class classification (OCC), which models one single positive class a...
research
09/10/2020

Finding Stable Groups of Cross-Correlated Features in Multi-View data

Multi-view data, in which data of different types are obtained from a co...
research
07/08/2021

Encoding Domain Information with Sparse Priors for Inferring Explainable Latent Variables

Latent variable models are powerful statistical tools that can uncover r...

Please sign up or login with your details

Forgot password? Click here to reset