Generalized R^2 Measures for a Mixture of Bivariate Linear Dependences

11/25/2018
by   Jingyi Jessica Li, et al.
0

Motivated by the pressing needs for capturing complex but interperetable variable relationships in scientific research, here we develop new mathematical foundation and statistical methodologies to generalize the squared Pearson correlation, i.e., the R^2, to capture a mixture of linear dependences between two real-valued random variables. We define the population and sample generalized R^2 measures under the supervised and unsupervised scenarios, and we derive the asymptotic distributions of the sample measures to enable computationally efficient statistical inference of the population measures. To compute the sample generalized R^2 measure under the unsupervised scenario, we develop a K-lines clustering algorithm and investigate its connection to gradient descent and expectation-maximization algorithms. Our simulation results provide additional numerical verification of the theoretical results. Two real data genomic applications demonstrate the effectiveness of the generalized R^2 measures in capturing interpretable gene-gene relationships that are likely missed by existing association measures. The estimation and inference procedures are implemented in an R package gR2.

READ FULL TEXT
research
03/03/2022

From local to global gene co-expression estimation using single-cell RNA-seq data

In genomics studies, the investigation of the gene relationship often br...
research
11/18/2020

Matrix compatibility and correlation mixture representation of generalized Gini's gamma

Representations of measures of concordance in terms of Pearson's correla...
research
09/17/2020

Statistical Inference for High-Dimensional Vector Autoregression with Measurement Error

High-dimensional vector autoregression with measurement error is frequen...
research
07/01/2021

Sparse GCA and Thresholded Gradient Descent

Generalized correlation analysis (GCA) is concerned with uncovering line...
research
10/26/2017

From Distance Correlation to Multiscale Generalized Correlation

Understanding and developing a correlation measure that can detect gener...
research
06/17/2023

Distributed Semi-Supervised Sparse Statistical Inference

This paper is devoted to studying the semi-supervised sparse statistical...
research
08/14/2020

Recursive linearization method for inverse medium scattering problems with complex mixture Gaussian error learning

This paper is concerned with the numerical errors that have appeared in ...

Please sign up or login with your details

Forgot password? Click here to reset