Longest Common Subsequence: Tabular vs. Closed-Form Equation Computation of Subsequence Probability

06/23/2022
by   Alireza Abdi, et al.
0

The Longest Common Subsequence Problem (LCS) deals with finding the longest subsequence among a given set of strings. The LCS problem is an NP-hard problem which makes it a target for lots of effort to find a better solution with heuristics methods. The baseline for most famous heuristics functions is a tabular random, probabilistic approach. This approach approximates the length of the LCS in each iteration. The combination of beam search and tabular probabilistic-based heuristics has led to a large number of proposals and achievements in algorithms for solving the LCS problem. In this work, we introduce a closed-form equation of the probabilistic table calculation for the first time. Moreover, we present other corresponding forms of the closed-form equation and prove all of them. The closed-form equation opens new ways for analysis and further approximations. Using the theorems and beam search, we propose an analytic method for estimating the length of the LCS of the remaining subsequence. Furthermore, we present another heuristic function based on the Coefficient of Variation. The results show that our proposed methods outperform the state-of-the-art methods on the LCS problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2022

The Chvátal-Sankoff problem: Understanding random string comparison through stochastic processes

Given two equally long, uniformly random binary strings, the expected le...
research
12/03/2022

Longest Common Substring in Longest Common Subsequence's Solution Service: A Novel Hyper-Heuristic

The Longest Common Subsequence (LCS) is the problem of finding a subsequ...
research
03/28/2019

Probabilistic Analysis of Facility Location on Random Shortest Path Metrics

The facility location problem is an NP-hard optimization problem. Theref...
research
10/23/2019

Closed-Form Full Map Posteriors for Robot Localization with Lidar Sensors

A popular class of lidar-based grid mapping algorithms computes for each...
research
09/09/2015

A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem

Search is a central problem in artificial intelligence, and BFS and DFS ...
research
03/27/2013

A Polynomial Time Algorithm for Finding Bayesian Probabilities from Marginal Constraints

A method of calculating probability values from a system of marginal con...

Please sign up or login with your details

Forgot password? Click here to reset