Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems

08/27/2018
by   Chen Luo, et al.
0

In recent years, the Log Structured Merge (LSM) tree has been widely adopted by NoSQL and NewSQL systems for its superior write performance. Despite its popularity, however, most existing work has focused on LSM-based key-value stores with only a primary LSM-tree index; auxiliary structures, which are critical for supporting ad-hoc queries, have received much less attention. In this paper, we focus on efficient data ingestion and query processing for general-purpose LSM-based storage systems. We first propose and evaluate a series of optimizations for efficient batched point lookups, significantly improving the range of applicability of LSM-based secondary indexes. We then present several new and efficient maintenance strategies for LSM-based storage systems. Finally, we have implemented and experimentally evaluated the proposed techniques in the context of the Apache AsterixDB system, and we present the results here.

READ FULL TEXT

page 12

page 13

research
06/23/2019

On Performance Stability in LSM-based Storage Systems

The Log-Structured Merge-Tree (LSM-tree) has been widely adopted for use...
research
05/01/2023

An Update-intensive LSM-based R-tree Index

Many applications require update-intensive workloads on spatial objects,...
research
03/12/2018

A Modular Design for Geo-Distributed Querying

Most distributed storage systems provide limited abilities for querying ...
research
04/04/2020

On the Efficient Design of LSM Stores

In the last decade, key-value data storage systems have gained significa...
research
01/09/2018

Search on Secondary Attributes in Geo-Distributed Systems

In the age of big data, more and more applications need to query and ana...
research
04/19/2023

Tutorial: The Ubiquitous Skiplist, its Variants, and Applications in Modern Big Data Systems

The Skiplist, or skip list, originally designed as an in-memory data str...
research
02/09/2022

Constructing and Analyzing the LSM Compaction Design Space (Updated Version)

Log-structured merge (LSM) trees offer efficient ingestion by appending ...

Please sign up or login with your details

Forgot password? Click here to reset