A unifying framework for the modelling and analysis of STR DNA samples arising in forensic casework

02/27/2018
by   Robert George Cowell, et al.
0

This paper presents a new framework for analysing forensic DNA samples using probabilistic genotyping. Specifically it presents a mathematical framework for specifying and combining the steps in producing forensic casework electropherograms of short tandem repeat loci from DNA samples. It is applicable to both high and low template DNA samples, that is, samples containing either high or low amounts DNA. A specific model is developed within the framework, by way of particular modelling assumptions and approximations, and its interpretive power presented on examples using simulated data and data from a publicly available dataset. The framework relies heavily on the use of univariate and multivariate probability generating functions. It is shown that these provide a succinct and elegant mathematical scaffolding to model the key steps in the process. A significant development in this paper is that of new numerical methods for accurately and efficiently evaluating the probability distribution of amplicons arising from the polymerase chain reaction process, which is modelled as a discrete multi-type branching process. Source code in the scripting languages Python, R and Julia is provided for illustration of these methods. These new developments will be of general interest to persons working outside the province of forensic DNA interpretation that this paper focuses on.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2019

A sub-critical branching process model for application to analysing Y haplotype DNA mixtures

The treatment of short-tandem-repeat (STR) loci on the Y chromosome pres...
research
01/05/2023

Stochastics of DNA Quantification

A common approach to quantifying DNA involves repeated cycles of DNA amp...
research
11/22/2022

eDNAPlus: A unifying modelling framework for DNA-based biodiversity monitoring

DNA-based biodiversity surveys involve collecting physical samples from ...
research
05/16/2018

Distribution of Base Pair Alternations in a Periodic DNA Chain: Application of Polya Counting to a Physical System

In modeling DNA chains, the number of alternations between Adenine-Thymi...
research
02/13/2020

On Contamination of Symbolic Datasets

Data taking values on discrete sample spaces are the embodiment of moder...
research
01/28/2021

Private DNA Sequencing: Hiding Information in Discrete Noise

When an individual's DNA is sequenced, sensitive medical information bec...
research
07/27/2020

Swipe dynamics as a means of authentication: results from a Bayesian unsupervised approach

The field of behavioural biometrics stands as an appealing alternative t...

Please sign up or login with your details

Forgot password? Click here to reset