Combinatorial and computational investigations of Neighbor-Joining bias

07/18/2020
by   Ruth Davidson, et al.
0

The Neighbor-Joining algorithm is a popular distance-based phylogenetic method that computes a tree metric from a dissimilarity map arising from biological data. Realizing dissimilarity maps as points in Euclidean space, the algorithm partitions the input space into polyhedral regions indexed by the combinatorial type of the trees returned. A full combinatorial description of these regions has not been found yet; different sequences of Neighbor-Joining agglomeration events can produce the same combinatorial tree, therefore associating multiple geometric regions to the same algorithmic output. We resolve this confusion by defining agglomeration orders on trees, leading to a bijection between distinct regions of the output space and weighted Motzkin paths. As a result, we give a formula for the number of polyhedral regions depending only on the number of taxa. We conclude with a computational comparison between these polyhedral regions, to unveil biases introduced in any implementation of the algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2018

Geometric comparison of phylogenetic trees with different leaf sets

The metric space of phylogenetic trees defined by Billera, Holmes, and V...
research
11/06/2020

An Efficient Scheme for the Generation of Ordered Trees in Constant Amortized Time

Trees are useful entities allowing to model data structures and hierarch...
research
08/03/2017

Polynomial tuning of multiparametric combinatorial samplers

Boltzmann samplers and the recursive method are prominent algorithmic fr...
research
05/18/2019

Covering Metric Spaces by Few Trees

A tree cover of a metric space (X,d) is a collection of trees, so that ...
research
02/26/2020

Tuning as convex optimisation: a polynomial tuner for multi-parametric combinatorial samplers

Combinatorial samplers are algorithmic schemes devised for the approxima...
research
04/24/2023

A Transfer Principle: Universal Approximators Between Metric Spaces From Euclidean Universal Approximators

We build universal approximators of continuous maps between arbitrary Po...
research
01/24/2007

Contains and Inside relationships within combinatorial Pyramids

Irregular pyramids are made of a stack of successively reduced graphs em...

Please sign up or login with your details

Forgot password? Click here to reset