Distribution of Base Pair Alternations in a Periodic DNA Chain: Application of Polya Counting to a Physical System

05/16/2018
by   Malcolm Hillebrand, et al.
0

In modeling DNA chains, the number of alternations between Adenine-Thymine (AT) and Guanine-Cytosine (GC) base pairs can be considered as a measure of the heterogeneity of the chain, which in turn could affect its dynamics. A probability distribution function of the number of these alternations is derived for circular or periodic DNA. Since there are several symmetries to account for in the periodic chain, necklace counting methods are used. In particular, Polya's Enumeration Theorem is extended for the case of a group action that preserves partitioned necklaces. This, along with the treatment of generating functions as formal power series, allows for the direct calculation of the number of possible necklaces with a given number of AT base pairs, GC base pairs and alternations. The theoretically obtained probability distribution functions of the number of alternations are accurately reproduced by Monte Carlo simulations and fitted by Gaussians. The effect of the number of base pairs on the characteristics of these distributions is also discussed, as well as the effect of the ratios of the numbers of AT and GC base pairs.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

02/27/2018

A unifying framework for the modelling and analysis of STR DNA samples arising in forensic casework

This paper presents a new framework for analysing forensic DNA samples u...
01/10/2022

An examination of the spillage distribution

We examine a family of discrete probability distributions that describes...
02/15/2021

Expansions in Cantor real bases

We introduce and study series expansions of real numbers with an arbitra...
12/07/2020

Ratio of counts vs ratio of rates in Poisson processes

The often debated issue of `ratios of small numbers of events' is approa...
07/31/2016

Identification of repeats in DNA sequences using nucleotide distribution uniformity

Repetitive elements are important in genomic structures, functions and r...
09/11/2019

A sub-critical branching process model for application to analysing Y haplotype DNA mixtures

The treatment of short-tandem-repeat (STR) loci on the Y chromosome pres...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.