Pairwise sequence alignment at arbitrarily large evolutionary distance

07/25/2022
by   Brandon Legried, et al.
0

Ancestral sequence reconstruction is a key task in computational biology. It consists in inferring a molecular sequence at an ancestral species of a known phylogeny, given descendant sequences at the tip of the tree. In addition to its many biological applications, it has played a key role in elucidating the statistical performance of phylogeny estimation methods. Here we establish a formal connection to another important bioinformatics problem, multiple sequence alignment, where one attempts to best align a collection of molecular sequences under some mismatch penalty score by inserting gaps. Our result is counter-intuitive: we show that perfect pairwise sequence alignment with high probability is possible in principle at arbitrary large evolutionary distances - provided the phylogeny is known and dense enough. We use techniques from ancestral sequence reconstruction in the taxon-rich setting together with the probabilistic analysis of sequence evolution models involving insertions and deletions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2010

Evolutionary distances in the twilight zone -- a rational kernel approach

Phylogenetic tree reconstruction is traditionally based on multiple sequ...
research
07/18/2017

Efficient and consistent inference of ancestral sequences in an evolutionary model with insertions and deletions under dense taxon sampling

In evolutionary biology, the speciation history of living organisms is r...
research
09/26/2016

Robust Time-Series Retrieval Using Probabilistic Adaptive Segmental Alignment

Traditional pairwise sequence alignment is based on matching individual ...
research
05/30/2018

A Survey of the State-of-the-Art Parallel Multiple Sequence Alignment Algorithms on Multicore Systems

Evolutionary modeling applications are the best way to provide full info...
research
02/29/2016

Bioinformatics and Classical Literary Study

This paper describes the Quantitative Criticism Lab, a collaborative ini...
research
11/02/2018

Optimal Sequence Length Requirements for Phylogenetic Tree Reconstruction with Indels

We consider the phylogenetic tree reconstruction problem with insertions...
research
07/10/2023

A Linear Time Quantum Algorithm for Pairwise Sequence Alignment

Sequence Alignment is the process of aligning biological sequences in or...

Please sign up or login with your details

Forgot password? Click here to reset