Better Write Amplification for Streaming Data Processing

06/05/2023
by   Andrei Chulkov, et al.
0

Many current applications have to perform data processing in a streaming fashion. Doing so at a large scale requires a parallel system that must be equipped to handle straggling workers and different kinds of failures. YT is the main driver behind distributed systems at Yandex, home to its distributed file system, lock service, key-value storage, and internal MapReduce platform. We implement a new component of this system designed for performing streaming MapReduce operations, utilizing different core YT solutions to achieve fault-tolerance and exactly-once semantics while maintaining efficiency and low write amplification factors.

READ FULL TEXT

page 32

page 33

research
11/12/2018

On the Performance and Convergence of Distributed Stream Processing via Approximate Fault Tolerance

Fault tolerance is critical for distributed stream processing systems, y...
research
07/31/2019

Distributed Streaming Analytics on Large-scale Oceanographic Data using Apache Spark

Real-world data from diverse domains require real-time scalable analysis...
research
02/11/2021

Chiron: Optimizing Fault Tolerance in QoS-aware Distributed Stream Processing Jobs

Fault tolerance is a property which needs deeper consideration when deal...
research
11/25/2019

A Foundation of Lazy Streaming Graphs

A streaming graph system continuously processes a stream of operations o...
research
10/29/2021

Parallel-and-stream accelerator for computationally fast supervised learning

Two dominant distributed computing strategies have emerged to overcome t...
research
11/26/2019

Distributed graphs: in search of fast, low-latency, resource-efficient, semantics-rich Big-Data processing

Large graphs can be processed with single high-memory or distributed sys...
research
09/06/2021

Khaos: Dynamically Optimizing Checkpointing for Dependable Distributed Stream Processing

Distributed Stream Processing systems are becoming an increasingly essen...

Please sign up or login with your details

Forgot password? Click here to reset