Estimation and inference in metabolomics with non-random missing data and latent factors

09/05/2019
by   Chris McKennan, et al.
0

High throughput metabolomics data are fraught with both non-ignorable missing observations and unobserved factors that influence a metabolite's measured concentration, and it is well known that ignoring either of these complications can compromise estimators. However, current methods to analyze these data can only account for the missing data or unobserved factors, but not both. We therefore developed MetabMiss, a statistically rigorous method to account for both non-random missing data and latent factors in high throughput metabolomics data. Our methodology does not require the practitioner specify a probability model for the missing data, and makes investigating the relationship between the metabolome and tens, or even hundreds, of phenotypes computationally tractable. We demonstrate the fidelity of MetabMiss's estimates using both simulated and real metabolomics data. An R package that implements our method is available from https://github.com/chrismckennan/MetabMiss.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2018

Estimating and accounting for unobserved covariates in high dimensional correlated data

Many high dimensional and high-throughput biological datasets have compl...
research
10/23/2020

Learning from missing data with the Latent Block Model

Missing data can be informative. Ignoring this information can lead to m...
research
10/13/2017

Bayes factors for partially observed stochastic epidemic models

We consider the problem of model choice for stochastic epidemic models g...
research
09/08/2020

Data Requests and Scenarios for Data Design of Unobserved Events in Corona-related Confusion Using TEEDA

Due to the global violence of the novel coronavirus, various industries ...
research
09/14/2022

Typesafe Coordinate Systems in High-Throughput Sequencing Applications

High-throughput sequencing file formats and tools encode coordinate inte...
research
11/13/2021

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models w...

Please sign up or login with your details

Forgot password? Click here to reset