Gradients of Generative Models for Improved Discriminative Analysis of Tandem Mass Spectra

09/04/2019
by   John T. Halloran, et al.
0

Tandem mass spectrometry (MS/MS) is a high-throughput technology used toidentify the proteins in a complex biological sample, such as a drop of blood. A collection of spectra is generated at the output of the process, each spectrum of which is representative of a peptide (protein subsequence) present in the original complex sample. In this work, we leverage the log-likelihood gradients of generative models to improve the identification of such spectra. In particular, we show that the gradient of a recently proposed dynamic Bayesian network (DBN) may be naturally employed by a kernel-based discriminative classifier. The resulting Fisher kernel substantially improves upon recent attempts to combine generative and discriminative models for post-processing analysis, outperforming all other methods on the evaluated datasets. We extend the improved accuracy offered by the Fisher kernel framework to other search algorithms by introducing Theseus, a DBN representing a large number of widely used MS/MS scoring functions. Furthermore, with gradient ascent and max-product inference at hand, we use Theseus to learn model parameters without any supervision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2019

Learning Concave Conditional Likelihood Models for Improved Analysis of Tandem Mass Spectra

The most widely used technology to identify the proteins present in a co...
research
10/29/2014

Faster graphical model identification of tandem mass spectra using peptide word lattices

Liquid chromatography coupled with tandem mass spectrometry, also known ...
research
10/02/2020

Machine-learning-enhanced time-of-flight mass spectrometry analysis

Mass spectrometry is a widespread approach to work out what are the cons...
research
02/03/2019

GA-Novo: De Novo Peptide Sequencing via Tandem Mass Spectrometry using Genetic Algorithm

Proteomics is the large-scale analysis of the proteins. The common metho...
research
06/23/2015

Learning Discriminative Bayesian Networks from High-dimensional Continuous Neuroimaging Data

Due to its causal semantics, Bayesian networks (BN) have been widely emp...
research
08/20/2018

Peptide-Spectra Matching from Weak Supervision

As in many other scientific domains, we face a fundamental problem when ...

Please sign up or login with your details

Forgot password? Click here to reset