Incomplete Directed Perfect Phylogeny in Linear Time

10/12/2020
by   Giulia Bernardini, et al.
0

Reconstructing the evolutionary history of a set of species is a central task in computational biology. In real data, it is often the case that some information is missing: the Incomplete Directed Perfect Phylogeny (IDPP) problem asks, given a collection of species described by a set of binary characters with some unknown states, to complete the missing states in such a way that the result can be explained with a perfect directed phylogeny. Pe'er et al. proposed a solution that takes Õ(nm) time for n species and m characters. Their algorithm relies on pre-existing dynamic connectivity data structures: a computational study recently conducted by Fernández-Baca and Liu showed that, in this context, complex data structures perform worse than simpler ones with worse asymptotic bounds. This gives us the motivation to look into the particular properties of the dynamic connectivity problem in this setting, so as to avoid the use of sophisticated data structures as a blackbox. Not only are we successful in doing so, and give a much simpler Õ(nm)-time algorithm for the IDPP problem; our insights into the specific structure of the problem lead to an asymptotically faster algorithm, that runs in optimal O(nm) time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2018

A third strike against perfect phylogeny

Perfect phylogenies are fundamental in the study of evolutionary trees b...
research
05/03/2020

Efficiently Testing Simon's Congruence

Simon's congruence ∼_k is defined as follows: two words are ∼_k-equivale...
research
05/02/2020

Pointer-Machine Algorithms for Fully-Online Construction of Suffix Trees and DAWGs on Multiple Strings

We deal with the problem of maintaining the suffix tree indexing structu...
research
01/06/2023

Simpler and faster algorithms for detours in planar digraphs

In the directed detour problem one is given a digraph G and a pair of ve...
research
11/30/2020

Vertex Sparsification for Edge Connectivity in Polynomial Time

An important open question in the area of vertex sparsification is wheth...
research
11/25/2019

Listing Conflicting Triples in Optimal Time

Different sources of information might tell different stories about the ...

Please sign up or login with your details

Forgot password? Click here to reset