Analysis of Indexing Structures for Immutable Data (Full Version)

03/04/2020
by   Cong Yue, et al.
0

In emerging applications such as blockchains and collaborative data analytics, there are strong demands for data immutability, multi-version accesses, and tamper-evident controls. This leads to three new index structures for immutable data, namely Merkle Patricia Trie (MPT), Merkle Bucket Tree (MBT), and Pattern-Oriented-Split Tree (POS-Tree). Although these structures have been adopted in real applications, there is no systematic evaluation of their pros and cons in the literature. This makes it difficult for practitioners to choose the right index structure for their applications, as there is only a limited understanding of the characteristics of each index. To alleviate the above deficiency, we present a comprehensive analysis of the existing index structures for immutable data, evaluating both their asymptotic and empirical performance. Specifically, we show that MPT, MBT, and POS-Tree are all instances of a recently proposed framework, dubbed Structurally Invariant and Reusable Indexes (SIRI). We propose to evaluate the SIRI instances based on five essential metrics: their efficiency for four index operations (i.e., lookup, update, comparison, and merge), as well as their deduplication ratios (i.e., the size of the index with deduplication over the size without deduplication). We establish the worst-case guarantees of each index in terms of these five metrics, and we experimentally evaluate all indexes in a large variety of settings. Based on our theoretical and empirical analysis, we conclude that POS-Tree is a favorable choice for indexing immutable data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2020

Analysis of Indexing Structures for Immutable Data

In emerging applications such as blockchains and collaborative data anal...
research
02/08/2022

OSM-tree: A Sortedness-Aware Index

Indexes facilitate efficient querying when the selection predicate is on...
research
09/22/2020

There is No Such Thing as an "Index"! or: The next 500 Indexing Papers

Index structures are a building block of query processing and computer s...
research
04/15/2004

The Persistent Buffer Tree : An I/O-efficient Index for Temporal Data

In a variety of applications, we need to keep track of the development o...
research
08/21/2020

Metrics and Ambits and Sprawls, Oh My

A follow-up to my previous tutorial on metric indexing, this paper walks...
research
01/22/2019

Just-in-Time Index Compilation

Creating or modifying a primary index is a time-consuming process, as th...
research
09/01/2018

Open Data Analytical Model for Human Development Index Optimization to Support Government Policy

The transparency nature of Open Data is beneficial for citizens to evalu...

Please sign up or login with your details

Forgot password? Click here to reset