Efficient Reconstruction of Stochastic Pedigrees: Some Steps From Theory to Practice

04/10/2022
by   Elchanan Mossel, et al.
0

In an extant population, how much information do extant individuals provide on the pedigree of their ancestors? Recent work by Kim, Mossel, Ramnarayan and Turner (2020) studied this question under a number of simplifying assumptions, including random mating, fixed length inheritance blocks and sufficiently large founding population. They showed that under these conditions if the average number of offspring is a sufficiently large constant, then it is possible to recover a large fraction of the pedigree structure and genetic content by an algorithm they named REC-GEN. We are interested in studying the performance of REC-GEN on simulated data generated according to the model. As a first step, we improve the running time of the algorithm. However, we observe that even the faster version of the algorithm does not do well in any simulations in recovering the pedigree beyond 2 generations. We claim that this is due to the inbreeding present in any setting where the algorithm can be run, even on simulated data. To support the claim we show that a main step of the algorithm, called ancestral reconstruction, performs accurately in a idealized setting with no inbreeding but performs poorly in random mating populations. To overcome the poor behavior of REC-GEN we introduce a Belief-Propagation based heuristic that accounts for the inbreeding and performs much better in our simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2020

Efficient Reconstruction of Stochastic Pedigrees

We introduce a new algorithm called Rec-Gen for reconstructing the gene...
research
02/27/2013

An Evaluation of an Algorithm for Inductive Learning of Bayesian Belief Networks Usin

Bayesian learning of belief networks (BLN) is a method for automatically...
research
09/12/2017

Opportunistic Self Organizing Migrating Algorithm for Real-Time Dynamic Traveling Salesman Problem

Self Organizing Migrating Algorithm (SOMA) is a meta-heuristic algorithm...
research
04/03/2023

Fast Numerical Multivariate Multipoint Evaluation

We design nearly-linear time numerical algorithms for the problem of mul...
research
04/23/2021

QMUL-SDS at SCIVER: Step-by-Step Binary Classification for Scientific Claim Verification

Scientific claim verification is a unique challenge that is attracting i...
research
01/28/2021

Sequential Monte Carlo algorithms for agent-based models of disease transmission

Agent-based models of disease transmission involve stochastic rules that...
research
01/30/2018

Analysis of the Continued Logarithm Algorithm

The Continued Logarithm Algorithm - CL for short- introduced by Gosper i...

Please sign up or login with your details

Forgot password? Click here to reset