A Survey of the State-of-the-Art Parallel Multiple Sequence Alignment Algorithms on Multicore Systems

05/30/2018
by   Sara Shehab, et al.
0

Evolutionary modeling applications are the best way to provide full information to support in-depth understanding of evaluation of organisms. These applications mainly depend on identifying the evolutionary history of existing organisms and understanding the relations between them, which is possible through the deep analysis of their biological sequences. Multiple Sequence Alignment (MSA) is considered an important tool in such applications, where it gives an accurate representation of the relations between different biological sequences. In literature, many efforts have been put into presenting a new MSA algorithm or even improving existing ones. However, little efforts on optimizing parallel MSA algorithms have been done. Nowadays, large datasets become a reality, and big data become a primary challenge in various fields, which should be also a new milestone for new bioinformatics algorithms. This survey presents four of the state-of-the-art parallel MSA algorithms, TCoffee, MAFFT, MSAProbs, and M2Align. We provide a detailed discussion of each algorithm including its strengths, weaknesses, and implementation details and the effectiveness of its parallel implementation compared to the other algorithms, taking into account the MSA accuracy on two different datasets, BAliBASE and OXBench.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2021

Algorithms for normalized multiple sequence alignments

Sequence alignment supports numerous tasks in bioinformatics, natural la...
research
11/23/2010

Evolutionary distances in the twilight zone -- a rational kernel approach

Phylogenetic tree reconstruction is traditionally based on multiple sequ...
research
07/25/2022

Pairwise sequence alignment at arbitrarily large evolutionary distance

Ancestral sequence reconstruction is a key task in computational biology...
research
07/04/2018

Analyzing Big Datasets of Genomic Sequences: Fast and Scalable Collection of k-mer Statistics

Distributed approaches based on the map-reduce programming paradigm have...
research
03/01/2022

Counting stars: a survey on flexible Skyline Query approaches

Nowadays, as the quantity of data to process began to rise, so did the n...
research
04/29/2023

Maximum Match Subsequence Alignment Algorithm Finely Grained (MMSAA FG)

Sequence alignment is common nowadays as it is used in many fields to de...

Please sign up or login with your details

Forgot password? Click here to reset