Statistical Methods and Workflow for Analyzing Human Metabolomics Data

10/10/2017
by   Joseph Antonelli, et al.
0

High-throughput metabolomics investigations, when conducted in large human cohorts, represent a potentially powerful tool for elucidating the biochemical diversity and mechanisms underlying human health and disease. Large-scale metabolomics data, generated using targeted or nontargeted platforms, are increasingly more common. Appropriate statistical analysis of these complex high-dimensional data is critical for extracting meaningful results from such large-scale human metabolomics studies. Herein, we consider the main statistical analytical approaches that have been employed in human metabolomics studies. Based on the lessons learned and collective experience to date in the field, we propose a step-by-step framework for pursuing statistical analyses of human metabolomics data. We discuss the range of options and potential approaches that may be employed at each stage of data management, analysis, and interpretation, and offer guidance on analytical considerations that are important for implementing an analysis workflow. Certain pervasive analytical challenges facing human metabolomics warrant ongoing research. Addressing these challenges will allow for more standardization in the field and lead to analytical advances in metabolomics investigations with the potential to elucidate novel mechanisms underlying human health and disease.

READ FULL TEXT

page 16

page 22

page 23

page 24

page 25

research
10/10/2017

Quantitative Comparison of Statistical Methods for Analyzing Human Metabolomics Data

Background. Emerging technologies now allow for mass spectrometry based ...
research
10/22/2022

SplitStrains, a tool to identify and separate mixed Mycobacterium tuberculosis infections from WGS data

The occurrence of multiple strains of a bacterial pathogen such as M. tu...
research
01/28/2021

A Kernel-Based Neural Network for High-dimensional Genetic Risk Prediction Analysis

Risk prediction capitalizing on emerging human genome findings holds gre...
research
05/11/2022

Principal Amalgamation Analysis for Microbiome Data

In recent years microbiome studies have become increasingly prevalent an...
research
08/19/2022

Co-scheduling Ensembles of In Situ Workflows

Molecular dynamics (MD) simulations are widely used to study large-scale...
research
01/05/2021

Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data

Machine learning has been proven to be effective in various application ...
research
05/18/2023

Clarifying System 1 2 through the Common Model of Cognition

There have been increasing challenges to dual-system descriptions of Sys...

Please sign up or login with your details

Forgot password? Click here to reset