Ancestral state reconstruction with large numbers of sequences and edge-length estimation

03/31/2021
by   Lam Si Tung Ho, et al.
0

Likelihood-based methods are widely considered the best approaches for reconstructing ancestral states. Although much effort has been made to study properties of these methods, previous works often assume that both the tree topology and edge lengths are known. In some scenarios the tree topology might be reasonably well known for the taxa under study. When sequence length is much smaller than the number of species, however, edge lengths are not likely to be accurately estimated. We study the consistency of the maximum likelihood and empirical Bayes estimators of ancestral state of discrete traits in such settings under a star tree. We prove that the likelihood-based reconstruction is consistent under symmetric models but can be inconsistent under non-symmetric models. We show, however, that a simple consistent estimator for the ancestral states is available under non-symmetric models. The results illustrate that likelihood methods can unexpectedly have undesirable properties as the number of sequences considered get very large. Broader implications of the results are discussed.

READ FULL TEXT
research
11/02/2018

Optimal Sequence Length Requirements for Phylogenetic Tree Reconstruction with Indels

We consider the phylogenetic tree reconstruction problem with insertions...
research
09/12/2019

A taxonomy of estimator consistency on discrete estimation problems

We describe a four-level hierarchy mapping both all discrete estimation ...
research
09/24/2020

Reciprocal Maximum Likelihood Degrees of Brownian Motion Tree Models

We give an explicit formula for the reciprocal maximum likelihood degree...
research
09/27/2021

Non-destructive methods for assessing tree fiber length distributions in standing trees

One of the main concerns of silviculture and forest management focuses o...
research
03/10/2019

On the convergence of the maximum likelihood estimator for the transition rate under a 2-state symmetric model

Maximum likelihood estimators are used extensively to estimate unknown p...
research
06/15/2022

Reconstructing Ultrametric Trees from Noisy Experiments

The problem of reconstructing evolutionary trees or phylogenies is of gr...
research
03/07/2018

Long-branch attraction in species tree estimation: inconsistency of partitioned likelihood and topology-based summary methods

With advances in sequencing technologies, there are now massive amounts ...

Please sign up or login with your details

Forgot password? Click here to reset