Optimizing scoring function of dynamic programming of pairwise profile alignment using derivative free neural network

08/30/2017
by   Kazunori D Yamada, et al.
0

A profile comparison method with position-specific scoring matrix (PSSM) is one of the most accurate alignment methods. Currently, cosine similarity and correlation coefficient are used as scoring functions of dynamic programming to calculate similarity between PSSMs. However, it is unclear that these functions are optimal for profile alignment methods. At least, by definition, these functions cannot capture non-linear relationships between profiles. Therefore, in this study, we attempted to discover a novel scoring function, which was more suitable for the profile comparison method than the existing ones. Firstly we implemented a new derivative free neural network by combining the conventional neural network with evolutionary strategy optimization method. Next, using the framework, the scoring function was optimized for aligning remote sequence pairs. Nepal, the pairwise profile aligner with the novel scoring function significantly improved both alignment sensitivity and precision, compared to aligners with the existing functions. Nepal improved alignment quality because of adaptation to remote sequence alignment and increasing the expressive power of similarity score. The novel scoring function can be realized using a simple matrix operation and easily incorporated into other aligners. With our scoring function, the performance of homology detection and/or multiple sequence alignment for remote homologous sequences would be further improved.

READ FULL TEXT
research
03/15/2023

Matrices inducing generalized metric on sequences

Sequence comparison is a basic task to capture similarities and differen...
research
11/23/2010

Evolutionary distances in the twilight zone -- a rational kernel approach

Phylogenetic tree reconstruction is traditionally based on multiple sequ...
research
06/19/2009

Finding Significant Subregions in Large Image Databases

Images have become an important data source in many scientific and comme...
research
07/04/2021

Algorithms for normalized multiple sequence alignments

Sequence alignment supports numerous tasks in bioinformatics, natural la...
research
08/10/2018

Dynamic all scores matrices for LCS score

The problem of aligning two strings A,B in order to determine their simi...
research
05/04/2016

IISCNLP at SemEval-2016 Task 2: Interpretable STS with ILP based Multiple Chunk Aligner

Interpretable semantic textual similarity (iSTS) task adds a crucial exp...
research
06/08/2018

Evaluating CBR Similarity Functions for BAM Switching in Networks with Dynamic Traffic Profile

In an increasingly complex scenario for network management, a solution t...

Please sign up or login with your details

Forgot password? Click here to reset