Gain-loss-duplication models on a phylogeny: exact algorithms for computing the likelihood and its gradient

07/23/2021
by   Miklos Csuros, et al.
0

Gene gain-loss-duplication models are commonly based on continuous-time birth-death processes. Employed in a phylogenetic context, such models have been increasingly popular in studies of gene content evolution across multiple genomes. While the applications are becoming more varied and demanding, bioinformatics methods for probabilistic inference on copy numbers (or integer-valued evolutionary characters, in general) are scarce. We describe a flexible probabilistic framework for phylogenetic gene-loss-duplication models. The framework is based on a novel elementary representation by dependent random variables with well-characterized conditional distributions: binomial, Pólya (negative binomial), and Poisson. The corresponding graphical model yields exact numerical procedures for computing the likelihood and the posterior distribution of ancestral copy numbers. The resulting algorithms take quadratic time in the total number of copies. In addition, we show how the likelihood gradient can be computed by a linear-time algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2021

Inference and forecasting for continuous-time integer-valued trawl processes and their use in financial economics

This paper develops likelihood-based methods for estimation, inference, ...
research
08/09/2014

Robust Graphical Modeling with t-Distributions

Graphical Gaussian models have proven to be useful tools for exploring n...
research
01/10/2019

On large deviations for sums of discrete m-dependent random variables

The ratio P(S_n=x)/P(Z_n=x) is investigated for three cases: (a) when S_...
research
09/19/2010

Robust graphical modeling of gene networks using classical and alternative T-distributions

Graphical Gaussian models have proven to be useful tools for exploring n...
research
05/29/2019

Gradients do grow on trees: a linear-time O( N )-dimensional gradient for statistical phylogenetics

Calculation of the log-likelihood stands as the computational bottleneck...
research
02/26/2020

Comparing copy-number profiles under multi-copy amplifications and deletions

During cancer progression, malignant cells accumulate somatic mutations ...

Please sign up or login with your details

Forgot password? Click here to reset