Some Enumeration Problems in the Duplication-Loss Model of Genome Rearrangement

02/01/2019
by   Mladen Kovačević, et al.
0

Tandem-duplication-random-loss (TDRL) is an important genome rearrangement operation studied in evolutionary biology. This paper investigates some of the formal properties of TDRL operations on the symmetric group (the space of permutations over an n -set). In particular, motivated by error correction and reconstruction problems in DNA-based data storage applications, we determine the size of "balls" of radius one in the TDRL metric, as well as the cardinality of the maximum intersection of two balls. The corresponding problems for the so-called mirror (or palindromic) TDRL rearrangement operations are also investigated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2020

Capacity-Approaching Constrained Codes with Error Correction for DNA-Based Data Storage

We propose coding techniques that limit the length of homopolymers runs,...
research
07/01/2023

Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage

In DNA-based data storage, DNA codes with biochemical constraints and er...
research
02/15/2023

Indel Error Correction Codes for DNA Digital Data Storage and Retrieval

A procedure for storage and retrieval of Digital information in DNA stri...
research
12/19/2021

Lerna: Transformer Architectures for Configuring Error Correction Tools for Short- and Long-Read Genome Sequencing

Sequencing technologies are prone to errors, making error correction (EC...
research
04/06/2022

SPIDER-WEB enables stable, repairable, and encryptible algorithms under arbitrary local biochemical constraints in DNA-based storage

DNA has been considered as a promising medium for storing digital inform...
research
12/30/2018

ATHENA: Automated Tuning of Genomic Error Correction Algorithms using Language Models

The performance of most error-correction algorithms that operate on geno...
research
06/27/2014

Reconstructing subclonal composition and evolution from whole genome sequencing of tumors

Tumors often contain multiple subpopulations of cancerous cells defined ...

Please sign up or login with your details

Forgot password? Click here to reset