AutoShard – Declaratively Managing Hot Spot Data Objects in NoSQL Document Stores

11/01/2021
by   Stefanie Scherzinger, et al.
0

NoSQL document stores are becoming increasingly popular as backends in web development. Not only do they scale out to large volumes of data, many systems are even custom-tailored for this domain: NoSQL document stores like Google Cloud Datastore have been designed to support massively parallel reads, and even guarantee strong consistency in updating single data objects. However, strongly consistent updates cannot be implemented arbitrarily fast in large-scale distributed systems. Consequently, data objects that experience high-frequent writes can turn into severe performance bottlenecks. In this paper, we present AutoShard, a ready-to-use object mapper for Java applications running against NoSQL document stores. AutoShard's unique feature is its capability to gracefully shard hot spot data objects to avoid write contention. Using AutoShard, developers can easily handle hot spot data objects by adding minimally intrusive annotations to their application code. Our experiments show the significant impact of sharding on both the write throughput and the execution time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2021

Columnar Formats for Schemaless LSM-based Document Stores

In the last decade, document store database systems have gained more tra...
research
04/20/2023

"HOT" ChatGPT: The promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media

Harmful content is pervasive on social media, poisoning online communiti...
research
05/06/2021

Download time analysis for distributed storage systems with node failures

We consider a distributed storage system which stores several hot (popul...
research
06/12/2018

Performance evaluation for CRUD operations in asynchronously replicated document oriented database

NoSQL databases are becoming increasingly popular as more developers see...
research
06/02/2020

Web Document Categorization Using Naive Bayes Classifier and Latent Semantic Analysis

A rapid growth of web documents due to heavy use of World Wide Web neces...
research
02/25/2021

Fragmented Objects: Boosting Concurrency of Shared Large Objects

This work examines strategies to handle large shared data objects in dis...
research
06/15/2020

Triggerflow: Trigger-based Orchestration of Serverless Workflows

As more applications are being moved to the Cloud thanks to serverless c...

Please sign up or login with your details

Forgot password? Click here to reset