A nonparametric spatial test to identify factors that shape a microbiome

06/16/2018
by   Susheela P. Singh, et al.
0

The advent of high-throughput sequencing technologies has made data from DNA material readily available, leading to a surge of microbiome-related research establishing links between markers of microbiome health and specific outcomes. However, to harness the power of microbial communities we must understand not only how they affect us, but also how they can be influenced to improve outcomes. This area has been dominated by methods that reduce community composition to summary metrics, which can fail to fully exploit the complexity of community data. Recently, methods have been developed to model the abundance of taxa in a community, but they can be computationally intensive and do not account for spatial effects underlying microbial settlement. These spatial effects are particularly relevant in the microbiome setting because we expect communities that are close together to be more similar than those that are far apart. In this paper, we propose a flexible Bayesian spike-and-slab variable selection model for presence-absence indicators that accounts for spatial dependence and cross-dependence between taxa while reducing dimensionality in both directions. We show by simulation that in the presence of spatial dependence, popular distance-based hypothesis testing methods fail to preserve their advertised size, and the proposed method improves variable selection. Finally, we present an application of our method to an indoor fungal community found with homes across the contiguous United States.

READ FULL TEXT
research
02/23/2021

Identifying Gene-environment interactions with robust marginal Bayesian variable selection

In high-throughput genetics studies, an important aim is to identify gen...
research
09/17/2018

Spatial Variable Selection and An Application to Virginia Lyme Disease Emergence

Lyme disease is an infectious disease that is caused by a bacterium call...
research
10/15/2019

New Development of Bayesian Variable Selection Criteria for Spatial Point Process with Applications

Selecting important spatial-dependent variables under the nonhomogeneous...
research
03/22/2022

Bayesian outcome selection modelling

Psychiatric and social epidemiology often involves assessing the effects...
research
06/17/2020

Using machine learning to identify nontraditional spatial dependence in occupancy data

Occupancy data are spatially referenced contaminated binary responses us...
research
01/05/2021

Evaluating Fairness in the Presence of Spatial Autocorrelation

Fairness considerations for spatial data often get confounded by the und...
research
05/28/2019

Global forensic geolocation with deep neural networks

An important problem in forensic analyses is identifying the provenance ...

Please sign up or login with your details

Forgot password? Click here to reset