Using Signal Processing in Tandem With Adapted Mixture Models for Classifying Genomic Signals

11/03/2022
by   Saish Jaiswal, et al.
0

Genomic signal processing has been used successfully in bioinformatics to analyze biomolecular sequences and gain varied insights into DNA structure, gene organization, protein binding, sequence evolution, etc. But challenges remain in finding the appropriate spectral representation of a biomolecular sequence, especially when multiple variable-length sequences need to be handled consistently. In this study, we address this challenge in the context of the well-studied problem of classifying genomic sequences into different taxonomic units (strain, phyla, order, etc.). We propose a novel technique that employs signal processing in tandem with Gaussian mixture models to improve the spectral representation of a sequence and subsequently the taxonomic classification accuracies. The sequences are first transformed into spectra, and projected to a subspace, where sequences belonging to different taxons are better distinguishable. Our method outperforms a similar state-of-the-art method on established benchmark datasets by an absolute margin of 6.06 accuracy.

READ FULL TEXT
research
12/12/2017

Encoding DNA sequences by integer chaos game representation

DNA sequences are fundamental for encoding genetic information. The gene...
research
08/03/2022

Graph Signal Processing for Heterogeneous Change Detection Part II: Spectral Domain Analysis

This is the second part of the paper that provides a new strategy for th...
research
05/13/2018

Enhanced Signal Recovery via Sparsity Inducing Image Priors

Parsimony in signal representation is a topic of active research. Sparse...
research
06/20/2019

Adversarial Self-Paced Learning for Mixture Models of Hawkes Processes

We propose a novel adversarial learning strategy for mixture models of H...
research
07/28/2017

An Open Source C++ Implementation of Multi-Threaded Gaussian Mixture Models, k-Means and Expectation Maximisation

Modelling of multivariate densities is a core component in many signal p...
research
07/13/2021

Fast approximations of the Jeffreys divergence between univariate Gaussian mixture models via exponential polynomial densities

The Jeffreys divergence is a renown symmetrization of the statistical Ku...

Please sign up or login with your details

Forgot password? Click here to reset