Predicting pathways for old and new metabolites through clustering

11/28/2022
by   Thiru Siddharth, et al.
0

The diverse metabolic pathways are fundamental to all living organisms, as they harvest energy, synthesize biomass components, produce molecules to interact with the microenvironment, and neutralize toxins. While discovery of new metabolites and pathways continues, the prediction of pathways for new metabolites can be challenging. It can take vast amounts of time to elucidate pathways for new metabolites; thus, according to HMDB only 60 get assigned to pathways. Here, we present an approach to identify pathways based on metabolite structure. We extracted 201 features from SMILES annotations, and identified new metabolites from PubMed abstracts and HMDB. After applying clustering algorithms to both groups of features, we quantified correlations between metabolites, and found the clusters accurately linked 92 of known metabolites to their respective pathways. Thus, this approach could be valuable for predicting metabolic pathways for new metabolites.

READ FULL TEXT

page 2

page 7

page 8

research
02/09/2019

Clustering Bioactive Molecules in 3D Chemical Space with Unsupervised Deep Learning

Unsupervised clustering has broad applications in data stratification, p...
research
07/05/2021

Clustering Structure of Microstructure Measures

This paper builds the clustering model of measures of market microstruct...
research
11/01/2021

Living Literature Reviews

Literature reviews have long played a fundamental role in synthesizing t...
research
01/01/2020

Toward Generalized Clustering through an One-Dimensional Approach

After generalizing the concept of clusters to incorporate clusters that ...
research
06/18/2019

From Clustering to Cluster Explanations via Neural Networks

A wealth of algorithms have been developed to extract natural cluster st...
research
09/01/2019

Categorical Co-Frequency Analysis: Clustering Diagnosis Codes to Predict Hospital Readmissions

Accurately predicting patients' risk of 30-day hospital readmission woul...
research
02/15/2018

Reducing over-clustering via the powered Chinese restaurant process

Dirichlet process mixture (DPM) models tend to produce many small cluste...

Please sign up or login with your details

Forgot password? Click here to reset