Direct covariance matrix estimation with compositional data

12/19/2022
by   Aaron J. Molstad, et al.
0

Compositional data arise in many areas of research in the natural and biomedical sciences. One prominent example is in the study of the human gut microbiome, where one can measure the relative abundance of many distinct microorganisms in a subject's gut. Often, practitioners are interested in learning how the dependencies between microbes vary across distinct populations or experimental conditions. In statistical terms, the goal is to estimate a covariance matrix for the (latent) log-abundances of the microbes in each of the populations. However, the compositional nature of the data prevents the use of standard estimators for these covariance matrices. In this article, we propose an estimator of multiple covariance matrices which allows for information sharing across distinct populations of samples. Compared to some existing estimators, which estimate the covariance matrices of interest indirectly, our estimator is direct, ensures positive definiteness, and is the solution to a convex optimization problem. We compute our estimator using a proximal-proximal gradient descent algorithm. Asymptotic properties of our estimator reveal that it can perform well in high-dimensional settings. Through simulation studies, we demonstrate that our estimator can outperform existing estimators. We show that our method provides more reliable estimates than competitors in an analysis of microbiome data from subjects with chronic fatigue syndrome.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2014

Convex Banding of the Covariance Matrix

We introduce a new sparse estimator of the covariance matrix for high-di...
research
08/13/2020

Linear pooling of sample covariance matrices

We consider covariance matrix estimation in a setting, where there are m...
research
04/17/2023

Sparse Positive-Definite Estimation for Large Covariance Matrices with Repeated Measurements

In many fields of biomedical sciences, it is common that random variable...
research
03/01/2020

Estimating Multiple Precision Matrices with Cluster Fusion Regularization

We propose a penalized likelihood framework for estimating multiple prec...
research
09/11/2022

Large covariance matrix estimation via penalized log-det heuristics

This paper provides a comprehensive estimation framework for large covar...
research
01/02/2016

Joint Estimation of Precision Matrices in Heterogeneous Populations

We introduce a general framework for estimation of inverse covariance, o...
research
06/05/2020

Reliable Covariance Estimation

Covariance or scatter matrix estimation is ubiquitous in most modern sta...

Please sign up or login with your details

Forgot password? Click here to reset