Fast detection of specific fragments against a set of sequences

08/06/2022
by   Marie-Pierre Béal, et al.
0

We design alignment-free techniques for comparing a sequence or word, called a target, against a set of words, called a reference. A target-specific factor of a target T against a reference R is a factor w of a word in T which is not a factor of a word of R and such that any proper factor of w is a factor of a word of R. We first address the computation of the set of target-specific factors of a target T against a reference R, where T and R are finite sets of sequences. The result is the construction of an automaton accepting the set of all considered target-specific factors. The construction algorithm runs in linear time according to the size of T∪ R. The second result consists in the design of an algorithm to compute all the occurrences in a single sequence T of its target-specific factors against a reference R. The algorithm runs in real-time on the target sequence, independently of the number of occurrences of target-specific factors.

READ FULL TEXT

page 5

page 8

research
03/26/2018

Unpopularity Factor in the Marriage and Roommates Problems

We develop an algorithm to compute an unpopularity factor of a given mat...
research
06/07/2018

Alignment-free sequence comparison using absent words

Sequence comparison is a prerequisite to virtually all comparative genom...
research
02/07/2021

Lie complexity of words

Given a finite alphabet Σ and a right-infinite word w over Σ, we define ...
research
11/02/2019

On The Study Of D-Optimal Saturated Designs For Mean, Main Effects and F_1-Two-Factor Interactions For 2^k-Factorial Experiments

The goal of this paper is to develop methods for the construction of sat...
research
08/07/2018

Circular critical exponents for Thue-Morse factors

We prove various results about the largest exponent of a repetition in a...
research
07/13/2023

Reconciling the Theory of Factor Sequences

Factor Sequences are stochastic double sequences (y_it: i ∈ℕ, t ∈ℤ) inde...
research
07/19/2023

Fundamental Limits of Reference-Based Sequence Reordering

The problem of reconstructing a sequence of independent and identically ...

Please sign up or login with your details

Forgot password? Click here to reset