Motivated by challenges in pangenomic read alignment, we propose a
gener...
A wavelet forest for a text T [1..n] over an alphabet σ takes n H_0
(T) ...
Recently, Conte et al. generalized the longest-common prefix (LCP) array...
FM-indexes are a crucial data structure in DNA alignment, for example, b...
It is not difficult to think of applications that can be modelled as gra...
Matching statistics were introduced to solve the approximate string matc...
Given a straight-line program with g rules for a text T [1..n], we can
b...
We describe how, given a text T [1..n] and a positive constant ϵ,
we can...
Consider a text T [1..n] prefixed by a reference sequence R = T
[1..ℓ]. ...
MONI (Rossi et al., 2022) can store a pangenomic dataset T in small spac...
In 1985 Hopcroft, Joseph and Whitesides introduced the problem of foldin...
There now exist compact indexes that can efficiently list all the occurr...
The Burrows-Wheeler Transform (BWT) is often taught in undergraduate cou...
We describe a new tool, KATKA, that stores a phylogenetic tree T such th...
We consider the space needed to store a searchable partial-sums data
str...
As nowadays Machine Learning (ML) techniques are generating huge data
co...
Maximal exact matches (MEMs) have been widely used in bioinformatics at ...
Experts would probably have guessed that compressed sparse bitvectors we...
We propose a natural variation of the Hopcroft, Joseph and Whitesides' (...
Gagie and Nekrich (2009) gave an algorithm for adaptive prefix-free codi...
We show how an Euler tour for a tree on n vertices with maximum degree d...
The r-index (Gagie et al., JACM 2020) represented a breakthrough in
comp...
Let G be a Wheeler graph and r be the number of runs in a Burrows-Wheele...
Computing the matching statistics of patterns with respect to a text is ...
Prefix-free parsing (PFP) was introduced by Boucher et al. (2019) as a
p...
The k^2-tree is a successful compact representation of binary relations
...
Shape grammars are well known in the graphics and design communities but...
The r-index is a tool for compressed indexing of genomic databases for
e...
Data compression is a powerful tool for managing massive but repetitive
...
Converting a set of sequencing reads into a lossless compact data struct...
While short read aligners, which predominantly use the FM-index, are abl...
The Burrows-Wheeler Transform (BWT) is an important technique both in da...
We present RCT, a new compact data structure to represent trajectories o...
The advent of high-throughput sequencing has resulted in massive genomic...
Indexing highly repetitive texts --- such as genomic databases, software...
We present the first solution to τ-majorities on tree paths. Given a
tre...
De novo DNA assembly is a fundamental task in Bioinformatics, and findin...
High-throughput sequencing technologies have led to explosive growth of
...
The Block Tree (BT) is a novel compact data structure designed to compre...
We consider the problem of decompressing the Lempel-Ziv 77 representatio...
Lempel-Ziv 1977 (LZ77) parsing, matching statistics and the Burrows-Whee...
In this paper we give an infinite family of strings for which the length...