Wheeler maps

08/18/2023
by   Andrej Balaz, et al.
0

Motivated by challenges in pangenomic read alignment, we propose a generalization of Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text T[1..n] and an assignment of tags to the characters of T such that we can preprocess a pattern P[1..m] and then, given i and j, quickly return all the distinct tags labeling the first characters of the occurrences of P[i..j] in T. For the applications that most interest us, characters with long common contexts are likely to have the same tag, so we consider the number t of runs in the list of tags sorted by their characters' positions in the Burrows-Wheeler Transform (BWT) of T. We show how, given a straight-line program with g rules for T, we can build an O(g + r + t)-space Wheeler map, where r is the number of runs in the BWT of T, with which we can preprocess a pattern P[1..m] in O(m log n) time and then return the k distinct tags for P[i..j] in optimal O(k) time for any given i and j. We show various further results related to prioritizing the most frequent tags.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2021

Tagged Documents Co-Clustering

Tags are short sequences of words allowing to describe textual and non-t...
research
01/16/2020

Generalised Pattern Matching Revisited

In the problem of Generalised Pattern Matching (GPM) [STOC'94, Muthukris...
research
05/02/2020

Single Model Ensemble using Pseudo-Tags and Distinct Vectors

Model ensemble techniques often increase task performance in neural netw...
research
06/30/2022

Computing the Parameterized Burrows–Wheeler Transform Online

Parameterized strings are a generalization of strings in that their char...
research
08/16/2023

An Efficient Early-breaking Estimation and Tree-splitting Missing RFID Tag Identification Protocol

Recent statistics have demonstrated that missing items have become the m...
research
02/16/2021

A Read Range Maximization Approach for Ultra-Wideband SAW Tags based on Interdigital Transducer (IDT) as a Reflector

Radio Frequency Identification (RFID) technology using readers and ID ta...
research
11/17/2017

Metric Map Merging using RFID Tags & Topological Information

A map merging component is crucial for the proper functionality of a mul...

Please sign up or login with your details

Forgot password? Click here to reset