The Capacity of Some Pólya String Models

08/18/2018
by   Ohad Elishco, et al.
0

We study random string-duplication systems, which we call Pólya string models. These are motivated by DNA storage in living organisms, and certain random mutation processes that affect their genome. Unlike previous works that study the combinatorial capacity of string-duplication systems, or various string statistics, this work provides exact capacity or bounds on it, for several probabilistic models. In particular, we study the capacity of noisy string-duplication systems, including the tandem-duplication, end-duplication, and interspersed-duplication systems. Interesting connections are drawn between some systems and the signature of random permutations, as well as to the beta distribution common in population genetics.

READ FULL TEXT

page 3

page 7

page 10

research
12/22/2021

On the Reverse-Complement String-Duplication System

Motivated by DNA storage in living organisms, and by known biological mu...
research
12/05/2018

Evolution of k-mer Frequencies and Entropy in Duplication and Substitution Mutation Systems

Genomic evolution can be viewed as string-editing processes driven by mu...
research
06/24/2019

Dynamic Palindrome Detection

Lately, there is a growing interest in dynamic string matching problems....
research
04/12/2018

Unique Reconstruction of Coded Strings from Multiset Substring Spectra

The problem of reconstructing strings from their substring spectra has a...
research
02/28/2021

On Problems Dual to Unification: The String-Rewriting Case

In this paper, we investigate problems which are dual to the unification...
research
10/31/2021

Computing Matching Statistics on Repetitive Texts

Computing the matching statistics of a string P[1..m] with respect to a ...
research
04/06/2020

SOPanG 2: online searching over a pan-genome without false positives

The pan-genome can be stored as elastic-degenerate (ED) string, a recent...

Please sign up or login with your details

Forgot password? Click here to reset