Improved Compressed String Dictionaries

11/19/2019
by   Nieves R. Brisaboa, et al.
0

We introduce a new family of compressed data structures to efficiently store and query large string dictionaries in main memory. Our main technique is a combination of hierarchical Front-coding with ideas from longest-common-prefix computation in suffix arrays. Our data structures yield relevant space-time tradeoffs in real-world dictionaries. We focus on two domains where string dictionaries are extensively used and efficient compression is required: URL collections, a key element in Web graphs and applications such as Web mining; and collections of URIs and literals, the basic components of RDF datasets. Our experiments show that our data structures achieve better compression than the state-of-the-art alternatives while providing very competitive query times.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2020

Indexing Highly Repetitive String Collections

Two decades ago, a breakthrough in indexing string collections made it p...
research
06/14/2019

Dynamic Path-Decomposed Tries

A keyword dictionary is an associative array whose keys are strings. Rec...
research
05/26/2023

CARAMEL: A Succinct Read-Only Lookup Table via Compressed Static Functions

Lookup tables are a fundamental structure in many data processing and sy...
research
04/26/2022

A Review of In-Memory Space-Efficient Data Structures for Temporal Graphs

Temporal graphs model relationships among entities over time. Recent stu...
research
12/02/2022

Trie-Compressed Intersectable Sets

We introduce space- and time-efficient algorithms and data structures fo...
research
02/26/2020

Revisiting compact RDF stores based on k2-trees

We present a new compact representation to efficiently store and query l...
research
11/13/2020

A grammar compressor for collections of reads with applications to the construction of the BWT

We describe a grammar for DNA sequencing reads from which we can compute...

Please sign up or login with your details

Forgot password? Click here to reset