Relaxed random walks at scale

06/11/2019
by   Alexander A. Fisher, et al.
0

Relaxed random walk (RRW) models of trait evolution introduce branch-specific rate multipliers to modulate the variance of a standard Brownian diffusion process along a phylogeny and more accurately model overdispersed biological data. Increased taxonomic sampling challenges inference under RRWs as the number of unknown parameters grows with the number of taxa. To solve this problem, we present a scalable method to efficiently fit RRWs and infer this branch-specific variation in a Bayesian framework. We develop a Hamiltonian Monte Carlo (HMC) sampler to approximate the high-dimensional, correlated posterior that exploits a closed-form evaluation of the gradient of the trait data log-likelihood with respect to all branch-rate multipliers simultaneously. Remarkably, this gradient calculation achieves computational complexity that scales only linearly with the number of taxa under study. We compare the efficiency of our HMC sampler to the previously standard univariable Metropolis-Hastings approach while studying the spatial emergence of the West Nile virus in North America in the early 2000s. Our method achieves an over 300-fold speed increase over the univariable approach. Additionally, we demonstrate the scalability of our method by applying the RRW to study the correlation between mammalian adult body mass and litter size in a phylogenetic tree with 2306 tips.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2021

Shrinkage-based random local clocks with scalable inference

Local clock models propose that the rate of molecular evolution is const...
research
03/08/2023

Many-core algorithms for high-dimensional gradients on phylogenetic trees

The rapid growth in genomic pathogen data spurs the need for efficient i...
research
03/06/2015

Hamiltonian ABC

Approximate Bayesian computation (ABC) is a powerful and elegant framewo...
research
05/29/2019

Gradients do grow on trees: a linear-time O( N )-dimensional gradient for statistical phylogenetics

Calculation of the log-likelihood stands as the computational bottleneck...
research
03/15/2022

Amortised inference of fractional Brownian motion with linear computational complexity

We introduce a simulation-based, amortised Bayesian inference scheme to ...
research
01/18/2022

Hamiltonian zigzag accelerates large-scale inference for conditional dependencies between complex biological traits

Inferring dependencies between complex biological traits while accountin...
research
06/22/2022

Automatic Zig-Zag sampling in practice

Novel Monte Carlo methods to generate samples from a target distribution...

Please sign up or login with your details

Forgot password? Click here to reset