Analysis of the Relationships among Longest Common Subsequences, Shortest Common Supersequences and Patterns and its application on Pattern Discovery in Biological Sequences

03/13/2009
by   Kang Ning, et al.
0

For a set of mulitple sequences, their patterns,Longest Common Subsequences (LCS) and Shortest Common Supersequences (SCS) represent different aspects of these sequences profile, and they can all be used for biological sequence comparisons and analysis. Revealing the relationship between the patterns and LCS,SCS might provide us with a deeper view of the patterns of biological sequences, in turn leading to better understanding of them. However, There is no careful examinaton about the relationship between patterns, LCS and SCS. In this paper, we have analyzed their relation, and given some lemmas. Based on their relations, a set of algorithms called the PALS (PAtterns by Lcs and Scs) algorithms are propsoed to discover patterns in a set of biological sequences. These algorithms first generate the results for LCS and SCS of sequences by heuristic, and consequently derive patterns from these results. Experiments show that the PALS algorithms perform well (both in efficiency and in accuracy) on a variety of sequences. The PALS approach also provides us with a solution for transforming between the heuristic results of SCS and LCS.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2012

Classification of Approaches and Challenges of Frequent Subgraphs Mining in Biological Networks

Understanding the structure and dynamics of biological networks is one o...
research
04/04/2022

Explicit and Implicit Pattern Relation Analysis for Discovering Actionable Negative Sequences

Real-life events, behaviors and interactions produce sequential data. An...
research
09/02/2020

A Study of Opacity Ranges for Transparent Overlays in 3D Landscapes

When visualizing data in a realistically rendered 3D virtual environment...
research
03/06/2022

An Interactive Gameplay to Crowdsource Multiple Sequence Alignment of Genome Sequences: Genenigma

Comparative genomics is a field of research that compares genomes of dif...
research
09/26/2022

ImmunoLingo: Linguistics-based formalization of the antibody language

Apparent parallels between natural language and biological sequence have...
research
05/30/2018

A Survey of the State-of-the-Art Parallel Multiple Sequence Alignment Algorithms on Multicore Systems

Evolutionary modeling applications are the best way to provide full info...
research
12/22/2015

Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns

We study how to obtain concise descriptions of discrete multivariate seq...

Please sign up or login with your details

Forgot password? Click here to reset