Tuning the Tail Latency of Distributed Queries Using Replication

12/20/2022
by   Nathan Ng, et al.
0

Querying graph data with low latency is an important requirement in application domains such as social networks and knowledge graphs. Graph queries perform multiple hops between vertices. When data is partitioned and stored across multiple servers, queries executing at one server often need to hop to vertices stored by another server. Such distributed traversals represent a performance bottleneck for low-latency queries. To reduce query latency, one can replicate remote data to make distributed traversals unnecessary, but replication is expensive and should be minimized. In this paper, we introduce the problem of finding data replication schemes that satisfy arbitrary user-defined query latency constraints with minimal replication cost. We propose a novel workload model to express data access causality, propose a family of heuristics, and introduce non-trivial sufficient conditions for their correctness. Our evaluation on two representative benchmarks show that our algorithms enable fine-tuning query latency with data replication and can find sweet spots in the latency/replication design space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2014

RepNet: Cutting Tail Latency in Data Center Networks with Flow Replication

Data center networks need to provide low latency, especially at the tail...
research
11/21/2017

Non-uniform Replication

Replication is a key technique in the design of efficient and reliable d...
research
05/19/2022

On Efficiently Partitioning a Topic in Apache Kafka

Apache Kafka addresses the general problem of delivering extreme high vo...
research
11/23/2021

LEGOStore: A Linearizable Geo-Distributed Store Combining Replication and Erasure Coding

We design and implement LEGOStore, an erasure coding (EC) based lineariz...
research
03/15/2018

Global Stabilization for Causally Consistent Partial Replication

Causally consistent distributed storage systems have received significan...
research
02/22/2018

NetChain: Scale-Free Sub-RTT Coordination (Extended Version)

Coordination services are a fundamental building block of modern cloud s...
research
06/01/2022

Good Intentions: Adaptive Parameter Servers via Intent Signaling

Parameter servers (PSs) ease the implementation of distributed training ...

Please sign up or login with your details

Forgot password? Click here to reset