Identification of repeats in DNA sequences using nucleotide distribution uniformity

07/31/2016
by   Changchuan Yin, et al.
0

Repetitive elements are important in genomic structures, functions and regulations, yet effective methods in precisely identifying repetitive elements in DNA sequences are not fully accessible, and the relationship between repetitive elements and periodicities of genomes is not clearly understood. We present an ab initio method to quantitatively detect repetitive elements and infer the consensus repeat pattern in repetitive elements. The method uses the measure of the distribution uniformity of nucleotides at periodic positions in DNA sequences or genomes. It can identify periodicities, consensus repeat patterns, copy numbers and perfect levels of repetitive elements. The results of using the method on different DNA sequences and genomes demonstrate efficacy and accuracy in identifying repeat patterns and periodicities. The complexity of the method is linear with respect to the lengths of the analyzed sequences.

READ FULL TEXT

page 5

page 6

research
05/11/2021

Constrained Consensus Sequence Algorithm for DNA Archiving

The paper describes an algorithm to compute a consensus sequence from a ...
research
12/12/2017

Encoding DNA sequences by integer chaos game representation

DNA sequences are fundamental for encoding genetic information. The gene...
research
11/18/2018

Prediction of Signal Sequences in Abiotic Stress Inducible Genes from Main Crops by Association Rule Mining

It is important to study on genes affecting to growing environment of ma...
research
08/13/2018

Clustering genomic words in human DNA using peaks and trends of distributions

In this work we seek clusters of genomic words in human DNA by studying ...
research
07/06/2019

Investigating some attributes of periodicity in DNA sequences via semi-Markov modelling

DNA segments and sequences have been studied thoroughly during the past ...
research
06/28/2000

Correlation over Decomposed Signals: A Non-Linear Approach to Fast and Effective Sequences Comparison

A novel non-linear approach to fast and effective comparison of sequence...
research
05/16/2018

Distribution of Base Pair Alternations in a Periodic DNA Chain: Application of Polya Counting to a Physical System

In modeling DNA chains, the number of alternations between Adenine-Thymi...

Please sign up or login with your details

Forgot password? Click here to reset