Round-Hashing for Data Storage: Distributed Servers and External-Memory Tables

05/08/2018
by   Roberto Grossi, et al.
0

This paper proposes round-hashing, which is suitable for data storage on distributed servers and for implementing external-memory tables in which each lookup retrieves at most a single block of external memory, using a stash. For data storage, round-hashing is like consistent hashing as it avoids a full rehashing of the keys when new servers are added. Experiments show that the speed to serve requests is tenfold or more than the state of the art. In distributed data storage, this guarantees better throughput for serving requests and, moreover, greatly reduces decision times for which data should move to new servers as rescanning data is much faster.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/08/2018

Asymptotic Miss Ratio of LRU Caching with Consistent Hashing

To efficiently scale data caching infrastructure to support emerging big...
research
12/23/2018

AnchorHash: A Scalable Consistent Hash

Consistent hashing (CH) is a central building block in many networking a...
research
02/18/2015

Cross-Modality Hashing with Partial Correspondence

Learning a hashing function for cross-media search is very desirable due...
research
09/04/2023

Objcache: An Elastic Filesystem over External Persistent Storage for Container Clusters

Container virtualization enables emerging AI workloads such as model ser...
research
06/06/2022

Managing Bufferbloat in Storage Systems

Today, companies and data centers are moving towards distributed and ser...
research
07/03/2020

Mapping Datasets to Object Storage System

Access libraries such as ROOT and HDF5 allow users to interact with data...
research
07/16/2021

DxHash: A Scalable Consistent Hash Based on the Pseudo-Random Sequence

Consistent hasing has played a fundamental role as a data router and a l...

Please sign up or login with your details

Forgot password? Click here to reset