Tree congruence: quantifying similarity between dendrogram topologies

09/11/2019
by   Steven U. Vidovic, et al.
0

Tree congruence metrics are typically global indices that describe the similarity or dissimilarity between dendrograms. This study principally focuses on topological congruence metrics that quantify similarity between two dendrograms and can give a normalised score between 0 and 1. Specifically, this article describes and tests two metrics the Clade Retention Index (CRI) and the MASTxCF which is derived from the combined information available from a maximum agreement subtree and a strict consensus. The two metrics were developed to study differences between evolutionary trees, but their applications are multidisciplinary and can be used on hierarchical cluster diagrams derived from analyses in science, technology, maths or social sciences disciplines. A comprehensive, but non-exhaustive review of other tree congruence metrics is provided and nine metrics are further analysed. 1,620 pairwise analyses of simulated dendrograms (which could be derived from any type of analysis) were conducted and are compared in Pac-man piechart matrices. Kendalls tau-b is used to demonstrate the concordance of the different metrics and Spearmans rho ranked correlations are used to support these findings. The results support the use of the CRI and MASTxCF as part of a suite of metrics, but it is recommended that permutation metrics such as SPR distances and weighted metrics are disregarded for the specific purpose of measuring similarity.

READ FULL TEXT

page 1

page 9

page 12

research
03/15/2018

The complexity of comparing multiply-labelled trees by extending phylogenetic-tree metrics

A multilabeled tree (or MUL-tree) is a rooted tree in which every leaf i...
research
07/29/2023

Fitting Tree Metrics with Minimum Disagreements

In the L_0 Fitting Tree Metrics problem, we are given all pairwise dista...
research
05/20/2016

Statistical Inference for Cluster Trees

A cluster tree provides a highly-interpretable summary of a density func...
research
09/05/2018

Randomized Incremental Construction of Net-Trees

Net-trees are a general purpose data structure for metric data that have...
research
11/25/2021

SPAGETI: Stabilizing Phylogenetic Assessment with Gene Evolutionary Tree Indices

The standard approach to estimate species trees is to align a selected s...
research
03/22/2016

New metrics for learning and inference on sets, ontologies, and functions

We propose new metrics on sets, ontologies, and functions that can be us...
research
01/08/2015

Quantifying Scripts: Defining metrics of characters for quantitative and descriptive analysis

Analysis of scripts plays an important role in paleography and in quanti...

Please sign up or login with your details

Forgot password? Click here to reset