Prefix-tree Decoding for Predicting Mass Spectra from Molecules

03/11/2023
by   Samuel Goldman, et al.
10

Computational predictions of mass spectra from molecules have enabled the discovery of clinically relevant metabolites. However, such predictive tools are still limited as they occupy one of two extremes, either operating (a) by fragmenting molecules combinatorially with overly rigid constraints on potential rearrangements and poor time complexity or (b) by decoding lossy and nonphysical discretized spectra vectors. In this work, we introduce a new intermediate strategy for predicting mass spectra from molecules by treating mass spectra as sets of chemical formulae, which are themselves multisets of atoms. After first encoding an input molecular graph, we decode a set of chemical subformulae, each of which specify a predicted peak in the mass spectra, the intensities of which are predicted by a second model. Our key insight is to overcome the combinatorial possibilities for chemical subformulae by decoding the formula set using a prefix tree structure, atom-type by atom-type, representing a general method for ordered multiset decoding. We show promising empirical results on mass spectra prediction tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2023

Mass Spectra Prediction with Structural Motif-based Graph Neural Networks

Mass spectra, which are agglomerations of ionized fragments from targete...
research
06/12/2021

ADAPTIVE: leArning DAta-dePendenT, concIse molecular VEctors for fast, accurate metabolite identification from tandem mass spectra

Motivation: Metabolite identification is an important task in metabolomi...
research
06/12/2021

SIMPLE: Sparse Interaction Model over Peaks of moLEcules for fast, interpretable metabolite identification from tandem mass spectra

Motivation: Recent success in metabolite identification from tandem mass...
research
10/03/2022

Neural network for determining an asteroid mineral composition from reflectance spectra

Chemical and mineral compositions of asteroids reflect the formation and...
research
08/17/2021

AGNet: Weighing Black Holes with Deep Learning

Supermassive black holes (SMBHs) are ubiquitously found at the centers o...
research
12/18/2020

Deep learning and high harmonic generation

For the high harmonic generation problem, we trained deep convolutional ...
research
10/14/2021

Predictive models of RNA degradation through dual crowdsourcing

Messenger RNA-based medicines hold immense potential, as evidenced by th...

Please sign up or login with your details

Forgot password? Click here to reset