Learning Seasonal Phytoplankton Communities with Topic Models

11/19/2017
by   Arnold Kalmbach, et al.
0

In this work we develop and demonstrate a probabilistic generative model for phytoplankton communities. The proposed model takes counts of a set of phytoplankton taxa in a timeseries as its training data, and models communities by learning sparse co-occurrence structure between the taxa. Our model is probabilistic, where communities are represented by probability distributions over the species, and each time-step is represented by a probability distribution over the communities. The proposed approach uses a non-parametric, spatiotemporal topic model to encourage the communities to form an interpretable representation of the data, without making strong assumptions about the communities. We demonstrate the quality and interpretability of our method by its ability to improve performance of a simplistic regression model. We show that simple linear regression is sufficient to predict the community distribution learned by our method, and therefore the taxon distributions, from a set of naively chosen environment variables. In contrast, a similar regression model is insufficient to predict the taxon distributions directly or through PCA with the same level of accuracy.

READ FULL TEXT

page 1

page 3

page 5

page 7

research
06/20/2018

Non-Parametric Calibration of Probabilistic Regression

The task of calibration is to retrospectively adjust the outputs from a ...
research
05/28/2020

Model selection for ecological community data using tree shrinkage priors

Researchers and managers model ecological communities to infer the bioti...
research
09/28/2018

On wavelets to select the parametric form of a regression model

Let Y be a response variable related with a set of explanatory variables...
research
12/09/2021

Fair Structure Learning in Heterogeneous Graphical Models

Inference of community structure in probabilistic graphical models may n...
research
04/26/2022

Multivariate and regression models for directional data based on projected Pólya trees

Projected distributions have proved to be useful in the study of circula...
research
03/09/2023

Probabilistic 3d regression with projected huber distribution

Estimating probability distributions which describe where an object is l...
research
03/18/2014

Communication Communities in MOOCs

Massive Open Online Courses (MOOCs) bring together thousands of people f...

Please sign up or login with your details

Forgot password? Click here to reset