LinDA: Linear Models for Differential Abundance Analysis of Microbiome Compositional Data

04/01/2021
by   Huijuan Zhou, et al.
0

One fundamental statistical task in microbiome data analysis is differential abundance analysis, which aims to identify microbial taxa whose abundance covaries with a variable of interest. Although the main interest is on the change in the absolute abundance, i.e., the number of microbial cells per unit area/volume at the ecological site such as the human gut, the data from a sequencing experiment reflects only the taxa relative abundances in a sample. Thus, microbiome data are compositional in nature. Analysis of such compositional data is challenging since the change in the absolute abundance of one taxon will lead to changes in the relative abundances of other taxa, making false positive control difficult. Here we present a simple, yet robust and highly scalable approach to tackle the compositional effects in differential abundance analysis. The method only requires the application of established statistical tools. It fits linear regression models on the centered log-ratio transformed data, identifies a bias term due to the transformation and compositional effect, and corrects the bias using the mode of the regression coefficients. Due to the algorithmic simplicity, our method is 100-1000 times faster than the state-of-the-art method ANCOM-BC. Under mild assumptions, we prove its asymptotic FDR control property, making it the first differential abundance method that enjoys a theoretical FDR control guarantee. The proposed method is very flexible and can be extended to mixed-effect models for the analysis of correlated microbiome data. Using comprehensive simulations and real data applications, we demonstrate that our method has overall the best performance in terms of FDR control and power among the competitors. We implemented the proposed method in the R package LinDA (https://github.com/zhouhj1994/LinDA).

READ FULL TEXT
research
01/21/2021

Robust Differential Abundance Test in Compositional Data

Differential abundance tests in the compositional data are essential and...
research
09/22/2019

IFAA: Robust association identification and Inference For Absolute Abundance in microbiome analyses

The target of inference in microbiome analyses is usually relative abund...
research
04/18/2019

Testing for differential abundance in compositional counts data, with application to microbiome studies

In order to identify which taxa differ in the microbiome community acros...
research
01/10/2022

A Statistical Analysis of Compositional Surveys

A common statistical problem is inference from positive-valued multivari...
research
04/16/2020

A Transformation-free Linear Regression for Compositional Outcomes and Predictors

Compositional data are common in many fields, both as outcomes and predi...
research
01/20/2022

A Guideline for the Statistical Analysis of Compositional Data in Immunology

The study of immune cellular composition is of great scientific interest...
research
05/26/2019

A Test for Differential Ascertainment in Case-Control Studies with Application to Child Maltreatment

We propose a method to test for the presence of differential ascertainme...

Please sign up or login with your details

Forgot password? Click here to reset