Matrices inducing generalized metric on sequences

03/15/2023
by   Eloi Araujo, et al.
0

Sequence comparison is a basic task to capture similarities and differences between two or more sequences of symbols, with countless applications such as in computational biology. An alignment is a way to compare sequences, where a giving scoring function determines the degree of similarity between them. Many scoring functions are obtained from scoring matrices. However,not all scoring matrices induce scoring functions which are distances, since the scoring function is not necessarily a metric. In this work we establish necessary and sufficient conditions for scoring matrices to induce each one of the properties of a metric in weighted edit distances. For a subset of scoring matrices that induce normalized edit distances, we also characterize each class of scoring matrices inducing normalized edit distances. Furthermore, we define an extended edit distance, which takes into account a set of editing operations that transforms one sequence into another regardless of the existence of a usual corresponding alignment to represent them, describing a criterion to find a sequence of edit operations whose weight is minimum. Similarly, we determine the class of scoring matrices that induces extended edit distances for each of the properties of a metric.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2013

Towards Normalizing the Edit Distance Using a Genetic Algorithms Based Scheme

The normalized edit distance is one of the distances derived from the ed...
research
05/07/2019

Transparent pronunciation scoring using articulatorily weighted phoneme edit distance

For researching effects of gamification in foreign language learning for...
research
07/04/2021

Algorithms for normalized multiple sequence alignments

Sequence alignment supports numerous tasks in bioinformatics, natural la...
research
08/30/2017

Optimizing scoring function of dynamic programming of pairwise profile alignment using derivative free neural network

A profile comparison method with position-specific scoring matrix (PSSM)...
research
11/08/2016

An Automated System for Essay Scoring of Online Exams in Arabic based on Stemming Techniques and Levenshtein Edit Operations

In this article, an automated system is proposed for essay scoring in Ar...
research
10/05/2016

A tentative model for dimensionless phoneme distance from binary distinctive features

This work proposes a tentative model for the calculation of dimensionles...
research
08/29/2014

Binary matrices of optimal autocorrelations as alignment marks

We define a new class of binary matrices by maximizing the peak-sidelobe...

Please sign up or login with your details

Forgot password? Click here to reset