Self-contained Beta-with-Spikes Approximation for Inference Under a Wright-Fisher Model

03/08/2023
by   Juan Guerrero Montero, et al.
0

We construct a reliable estimation of evolutionary parameters within the Wright-Fisher model, which describes changes in allele frequencies due to selection and genetic drift, from time-series data. Such data exists for biological populations, for example via artificial evolution experiments, and for the cultural evolution of behavior, such as linguistic corpora that document historical usage of different words with similar meanings. Our method of analysis builds on a Beta-with-Spikes approximation to the distribution of allele frequencies predicted by the Wright-Fisher model. We introduce a self-contained scheme for estimating the parameters in the approximation, and demonstrate its robustness with synthetic data, especially in the strong-selection and near-extinction regimes where previous approaches fail. We further apply to allele frequency data for baker's yeast (Saccharomyces cerevisiae), finding a significant signal of selection in cases where independent evidence supports such a conclusion. We further demonstrate the possibility of detecting time-points at which evolutionary parameters change in the context of a historical spelling reform in the Spanish language.

READ FULL TEXT
research
05/25/2023

Reliable identification of selection mechanisms in language change

Language change is a cultural evolutionary process in which variants of ...
research
11/07/2017

Bayesian Inference of Selection in the Wright-Fisher Diffusion Model

The increasing availability of population-level allele frequency data ac...
research
09/07/2021

Mutation frequency time series reveal complex mixtures of clones in the world-wide SARS-CoV-2 viral population

We compute the allele frequencies of the alpha (B.1.1.7), beta (B.1.351)...
research
02/03/2020

Phylogenetic signal in phonotactics

Phylogenetic methods have broad potential in linguistics beyond tree inf...
research
06/02/2018

Quantifying the dynamics of topical fluctuations in language

The availability of large diachronic corpora has provided the impetus fo...
research
12/22/2022

Small time approximation in Wright-Fisher diffusion

Wright-Fisher model has been widely used to represent random variation i...
research
07/21/2021

A Statistical Model of Word Rank Evolution

The availability of large linguistic data sets enables data-driven appro...

Please sign up or login with your details

Forgot password? Click here to reset