Delivery, consistency, and determinism: rethinking guarantees in distributed stream processing

07/14/2019
by   Artem Trofimov, et al.
0

Consistency requirements for state-of-the-art stream processing systems are defined in terms of delivery guarantees. Exactly-once is the strongest one and the most desirable for end-user. However, there are several issues regarding this concept. Commonly used techniques that enforce exactly-once produce significant performance overhead. Besides, the notion of exactly-once is not formally defined and does not capture all properties that provide stream processing systems supporting this guarantee. In this paper, we introduce a formal framework that allows us to define streaming guarantees more regularly. We demonstrate that the properties of delivery, consistency, and determinism are tightly connected within distributed stream processing. We also show that having lightweight determinism, it is possible to provide exactly-once with almost no performance overhead. Experiments show that the proposed approach can significantly outperform alternative industrial solutions.

READ FULL TEXT
research
11/12/2018

On the Performance and Convergence of Distributed Stream Processing via Approximate Fault Tolerance

Fault tolerance is critical for distributed stream processing systems, y...
research
11/26/2019

LogPlayer: Fault-tolerant Exactly-once Delivery using gRPC Asynchronous Streaming

In this paper, we present the design of our LogPlayer that is a componen...
research
07/30/2020

Description of the UPPAAL Models for SRP and CSRP and Verification of their Termination and Consistency Properties

The IEEE Audio Video Bridging (AVB) Task Group (TG) was created to provi...
research
08/21/2022

A Survey on Transactional Stream Processing

Transactional stream processing (TSP) has been increasingly gaining trac...
research
05/16/2017

Strider: A Hybrid Adaptive Distributed RDF Stream Processing Engine

Real-time processing of data streams emanating from sensors is becoming ...
research
06/05/2020

Verifying Strong Eventual Consistency in δ-CRDTs

Conflict-free replicated data types (CRDTs) are a natural structure with...
research
08/30/2017

TANKER: Distributed Architecture for Named Entity Recognition and Disambiguation

Named Entity Recognition and Disambiguation (NERD) systems have recently...

Please sign up or login with your details

Forgot password? Click here to reset