Parallel Longest Common SubSequence Analysis In Chapel

09/16/2023
by   Soroush Vahidi, et al.
0

One of the most critical problems in the field of string algorithms is the longest common subsequence problem (LCS). The problem is NP-hard for an arbitrary number of strings but can be solved in polynomial time for a fixed number of strings. In this paper, we select a typical parallel LCS algorithm and integrate it into our large-scale string analysis algorithm library to support different types of large string analysis. Specifically, we take advantage of the high-level parallel language, Chapel, to integrate Lu and Liu's parallel LCS algorithm into Arkouda, an open-source framework. Through Arkouda, data scientists can easily handle large string analytics on the back-end high-performance computing resources from the front-end Python interface. The Chapel-enabled parallel LCS algorithm can identify the longest common subsequences of two strings, and experimental results are given to show how the number of parallel resources and the length of input strings can affect the algorithm's performance.

READ FULL TEXT
research
08/02/2023

An Algorithm for the Constrained Longest Common Subsequence and Substring Problem

Let Σ be an alphabet. For two strings X, Y, and a constrained string P o...
research
07/17/2018

On Computing Centroids According to the p-Norms of Hamming Distance Vectors

In this paper we consider the p-Norm Hamming Centroid problem which asks...
research
11/02/2021

From Strings to Data Science: a Practical Framework for Automated String Handling

Many machine learning libraries require that string features be converte...
research
10/04/2022

Designing a parallel suffix sort

Suffix sort plays a critical role in various computational algorithms in...
research
06/03/2020

LCP-Aware Parallel String Sorting

When lexicographically sorting strings, it is not always necessary to in...
research
11/29/2017

A critical analysis of string APIs: The case of Pharo

Most programming languages, besides C, provide a native abstraction for ...
research
07/25/2023

A Compact DAG for Storing and Searching Maximal Common Subsequences

Maximal Common Subsequences (MCSs) between two strings X and Y are subse...

Please sign up or login with your details

Forgot password? Click here to reset