CARE: Large Precision Matrix Estimation for Compositional Data

09/13/2023
by   Shucong Zhang, et al.
0

High-dimensional compositional data are prevalent in many applications. The simplex constraint poses intrinsic challenges to inferring the conditional dependence relationships among the components forming a composition, as encoded by a large precision matrix. We introduce a precise specification of the compositional precision matrix and relate it to its basis counterpart, which is shown to be asymptotically identifiable under suitable sparsity assumptions. By exploiting this connection, we propose a composition adaptive regularized estimation (CARE) method for estimating the sparse basis precision matrix. We derive rates of convergence for the estimator and provide theoretical guarantees on support recovery and data-driven parameter tuning. Our theory reveals an intriguing trade-off between identification and estimation, thereby highlighting the blessing of dimensionality in compositional data analysis. In particular, in sufficiently high dimensions, the CARE estimator achieves minimax optimality and performs as well as if the basis were observed. We further discuss how our framework can be extended to handle data containing zeros, including sampling zeros and structural zeros. The advantages of CARE over existing methods are illustrated by simulation studies and an application to inferring microbial ecological networks in the human gut.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

Robust Covariance Estimation for High-dimensional Compositional Data with Application to Microbial Communities Analysis

Microbial communities analysis is drawing growing attention due to the r...
research
06/25/2021

MARS: A second-order reduction algorithm for high-dimensional sparse precision matrices estimation

Estimation of the precision matrix (or inverse covariance matrix) is of ...
research
07/07/2021

High dimensional precision matrix estimation under weak sparsity

In this paper, we estimate the high dimensional precision matrix under t...
research
02/15/2018

Robust and sparse Gaussian graphical modeling under cell-wise contamination

Graphical modeling explores dependences among a collection of variables ...
research
12/19/2018

Optimal covariance matrix estimation for high-dimensional noise in high-frequency data

In this paper, we consider efficiently learning the structural informati...
research
07/13/2022

Compositional Sparsity, Approximation Classes, and Parametric Transport Equations

Approximating functions of a large number of variables poses particular ...
research
05/12/2023

Robust score matching for compositional data

The restricted polynomially-tilted pairwise interaction (RPPI) distribut...

Please sign up or login with your details

Forgot password? Click here to reset