Grammar Compressed Sequences with Rank/Select Support

11/20/2019
by   Alberto Ordóñez, et al.
0

Sequence representations supporting not only direct access to their symbols, but also rank/select operations, are a fundamental building block in many compressed data structures. Several recent applications need to represent highly repetitive sequences, and classical statistical compression proves ineffective. We introduce, instead, grammar-based representations for repetitive sequences, which use up to 6 compressed representations, and support direct access and rank/select operations within tens of microseconds. We demonstrate the impact of our structures in text indexing applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2020

Learning Directly from Grammar Compressed Text

Neural networks using numerous text data have been successfully applied ...
research
04/04/2019

Proving tree algorithms for succinct data structures

Succinct data structures give space-efficient representations of large a...
research
05/23/2023

Engineering Rank/Select Data Structures for Big-Alphabet Strings

Big-alphabet strings are common in several scenarios such as information...
research
11/03/2018

Optimal Rank and Select Queries on Dictionary-Compressed Text

Let γ be the size of a string attractor for a string S of length n over ...
research
01/14/2020

Simulation computation in grammar-compressed graphs

Like [1], we present an algorithm to compute the simulation of a query p...
research
02/20/2020

Compressed Data Structures for Binary Relations in Practice

Binary relations are commonly used in Computer Science for modeling data...
research
10/04/2021

FM-Indexing Grammars Induced by Suffix Sorting for Long Patterns

The run-length compressed Burrows-Wheeler transform (RLBWT) used in conj...

Please sign up or login with your details

Forgot password? Click here to reset