Towards Normalizing the Edit Distance Using a Genetic Algorithms Based Scheme

by   Muhammad Marwan Muhammad Fuad, et al.

The normalized edit distance is one of the distances derived from the edit distance. It is useful in some applications because it takes into account the lengths of the two strings compared. The normalized edit distance is not defined in terms of edit operations but rather in terms of the edit path. In this paper we propose a new derivative of the edit distance that also takes into consideration the lengths of the two strings, but the new distance is related directly to the edit distance. The particularity of the new distance is that it uses the genetic algorithms to set the values of the parameters it uses. We conduct experiments to test the new distance and we obtain promising results.



page 6


Constant-factor approximation of near-linear edit distance in near-linear time

We show that the edit distance between two strings of length n can be co...

Soft edit distance for differentiable comparison of symbolic sequences

Edit distance, also known as Levenshtein distance, is an essential way t...

Algorithme de recherche approximative dans un dictionnaire fondé sur une distance d'édition définie par blocs

We propose an algorithm for approximative dictionary lookup, where alter...

Discovering Lexical Similarity Through Articulatory Feature-based Phonetic Edit Distance

Lexical Similarity (LS) between two languages uncovers many interesting ...

An Integrated Soft Computing Approach to a Multi-biometric Security Model

The abstract of the thesis consists of three sections, videlicet, Moti...

MinJoin: Efficient Edit Similarity Joins via Local Hash Minimums

In this paper we study edit similarity joins, in which we are given a se...

Imagined-Trailing-Whitespace-Agnostic Levenshtein Distance For Plaintext Table Detection

The standard algorithm for Levenshtein distance, treats trailing whitesp...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.